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POLYNUCLEOTIDE AND ITS USE FOR MODULATING A DEFENCE RESPONSE IN PLANTS 

The present invention relates to stimulating a defence 
response in plants, with a view to providing the plants with 
enhanced pathogen resistance. More specifically, it has 
resulted from cloning of the barley Mlo gene, various mutant 
mlo alleles, and a number of homologues from various species. 
The Mlo gene has been isolated using a positional cloning 
approach which has never previously been successful in Barley. 
Details and discussion are provided below. Wild- type Mlo 
exerts a negative regulatory function on a pathogen defence 
response, such that mutants exhibit a defence response in the 
absence of pathogen. In accordance with the present invention, 
down -regulation or out -competition of Mlo function may be used 
to stimulate a defence response in transgenic plants, 
conferring increased pathogen resistance. 

Mutations have been described in several plants in which 
defence responses to pathogens appear to be constitutively 
expressed. Mutation- induced recessive alleles imlo) of the 
barley Mlo locus exhibit a leaf lesion phenotype and confer an 
apparently durable, broad spectrum resistance to the powdery 
mildew pathogen, Erysiphe gramlnxs f sp horde! . 

Resistance responses to the powdery mildew pathogen have 
been genetically well characterized (Wiberg, 1974; Sagaard and 
J0rgensen, 1988; Jorgensen, 1994) . In most analyzed cases 
resistance is specified by race-specific resistance genes 
following the rules of Flor's gene-f or-gene hypothesis (Flor, 
1971). In this type of . plant /pathogen interaction, resistance 
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is specified by and dependent on the presence of two 
completnentary genes, one from the host and one from the fungal 
pathogen. The complementary genes have been termed 
operationally (pathogen) resistance C'-R") gene and avirulence 
5 gene, respectively. Most of the powdery mildew resistance genes 
KMlx) act as dominant or semidominant traits (Jergensen, 1994) . 

Monogenic resistance mediated by recessive (mlo) alleles 
of the Mia locus is different. Apart from being recessive, it 
differs from race -specific resistance co single pathogen 

10 strains in that (i) it confers broad spectrum resistance to 

almost all known isolates of the pathogen (ii) mla resistance 
alleles have been obtained by mutagen treatment of any tested 
susceptible wild type (Mid) variety, and (iii) mlo resistance 
alleles exhibit a defence mimic phenotype in the absence of the 

15 pathogen (Wolter et ai . , 1993). Thus, the genetic data 

indicate the Mla wild type allele exerts a negative regulatory 
function on defence responses to pathogen attack. 

Resistance mediated by inlo alleles is currently widely 
used in barley breeding and an estimated 10 million hectares 

20 are annually planted in Europe with seeds of this genotype. A 
*jnio like' inherited resistance to powdery mildew in other 
cereal plants has not been reported so far although the fungus 
is a relevant pathogen in wheat (attacked by Bi:yB±phG cframlnis 
f sp tritici) , oat (attacked by E. g. f sp a venae) , and rye 

25 (attacked by B, g. f sp secalis) . Because cereals are 

morphologically, genetically and biochemically highly related 
to each other (Moore et al . , 1995), one would predict the 
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existence of homologous genes in these species. The failure to 
have found a ^mlo like' inherited resistance in wheat and oat 
is probably due to their hexaploid genomes/ making it difficult 
to obtain by mutagenesis defective alleles in all six gene 
copies, and the chance of all such mutations occurring in 
Nature is remote. The failure to have found a mlo equivalent 
in other cereals is probably due to insignificant amount of 
mutational analysis in these species and complications as a 
result of their outbreeding nature {e.g. rye). 

RFLP markers closely linked to Mlo on barley chromosome 4 
were previously identified on the basis of a mlo backcross line 
collection containing mlo alleles from six genetic backgrounds 

(Hinze et al • , 1991) - The map position of Mlo on the basis of 
RFIjP markers was consistent with its chromosomal localization 

as determined by a previous mapping with morphological markers 

(Jorgensen, 1977) . 

Having identified an -3cM genetic interval containing Mlo 

bordered by genetic markers, we decided to attempt to isolate 

the gene via positional cloning. 

However, there is no documented example of a successful 

positional cloning attempt of a barley gene. We were faced 

with a number of difficulties. 

Q 

Firstly, the genome of barley (5.3x10 bp/haploid genome 
equivalent; Bennett and Smith, 1991) has almost double the size 
of the human genome and because the total genetic map covers 
-1.800 CM (Becker et al . , 1995) we were confronted with a very 
unfavourable ratio of genetic and physical distances (1 cM 
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corresponds to - 3 Mb) . 

Secondly, a high resolution genetic map had to be 
constructed around Mlo enabling the positioning of linked 
markers with a precision of better than 0.1 cM. 

Thirdly, we aimed to physically delimit the target gene 
and both flanking DNA markers on individual large insert 
genomic clones, a procedure later termed "chromosome landing" 
(Tanksley et al . , 1995). For this purpose, a complete barley 
YAC library from barley Megabase DNA had to be constructed with 
an average insert size of 500-600 kb, which was unprecedented. 

Fourthly, we had to prepare unusual genetic tools that 
enabled us to identify the Mlo gene within a physically 
delimited region without the need for a time consuming 
generation of barley transgenic plants and testing of different 
candidate genes. We used for our studies ten characterized 
radiation- or chemically- induced mlo mutants (Jergensen, 1992) . 
For a conclusive chain of evidence of the gene isolation we 
decided to depend upon a functional restoration of the wild 
type Mlo allele starting out from characterized mlo defective 
alleles- For this purpose, we performed mlo heteroallelic 
crosses and isolated susceptible intragenic Aflo recombinants - 
The sequence analysis of these proves the function of the 
described gene . 

The cloning of the barley Mlo gene and homologues, 
including homologues from other plant species, gives rise to a 
number of practical applications, reflected in the various 
aspects of the present invention. 
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According to a first aspect of the present invention there 
is provided a nucleic acid molecule comprising a nucleotide 
sequence encoding a peptide with Mlo function. Those skilled in 
the art will appreciate that "Mlo function" refers to the 
ability to suppress a defence response, said defence response 
being race and/or pathogen independent and autonomous of the 
presence of a pathogen, such as, for example, the Mlo gene of 
barley, the Acd gene and the Led gene of Arabxdopsis. 

mlo. mutations that down-regulate or disrupt functional 
expression of the wild-type Mlo sequence are recessive, such 
that they are complemented by expression of a wild-type 
sequence. Thus '^Mlo function" can be determined by assessing 
the level of constitutive defence response and/or 
susceptibility of the plant to a pathogen siich as, for example, 
powdery mildew or rust (e.g. yellow rust). Accordingly, a 
putative nucleotide sequence with Mlo function can be tested 
upon complementation of a suitable mlo mutant. The term '^mlo 
function" is used to refer to sequences which confer a mlo 
mutant phenotype on a plant . 

The capitalisation of "Mlo" and non-capitalisation of 
"mlo" is thus used to differentiate between "wild- type" and 
"mutant " function . 

A wlo mutant phenotype is characterised by the exhibition 
of an increased resistance against one or more pathogens, which 
is race and/or pathogen independent and autonomous of the 
presence of a pathogen. 

The test plant may be monocotyledonous or dicotyledonous. 



wo 98A)4586 



PCT/GB97/02046 



6 

Suitable monocots include any of barley, rice, wheat, maize or 
oat, particularly barley. Suitable dicots include Arahidopsis . 

Nucleic acid according to the invention may encode a 
polypeptide comprising the amino acid sequence shown in Figure 
2, or an allele, variant, derivative or mutant, or homologue, 
thereof . 

Nucleic acid according to the present invention may have 
the sequence of a Aflo gene of barley, or be a mutant, variant 
(or derivative) or allele of the sequence provided, or a 
homologue thereof. Preferred mutants/ variants and alleles are 
those which encode a sequence which retains a functional 
characteristic of the wild- type gene, especially the ability to 
suppress a defence response as discussed herein. Other 
preferred mutants, variants and alleles encode a sec[uence 
which, in a homozygote, cause constitutive activation of a 
defence response, or at least promotes activation of a defence 
response (i.e. is a mlo mutant sequence), e.g. by reducing or 
wholly or partly abolishing Mlo function. Preferred mutations 
giving mlo mutant sequences are shown in Table 1 . Changes to a 
sequence, to produce a mutant, derivative or variant, may be by 
one or more of addition, insertion, deletion or substitution of 
one or more nucleotides in the nucleic acid, leading to the 
addition, insertion, deletion and/or substitution of one or 
more amino acids. Of course, changes to the nucleic acid which 
make no difference to the encoded amino acid sequence are 
included. Particular variants, mutants, alleles and 



wo 98/04586 



PCT/GB97/02(M6 



7 " 

derivatives are discussed further below, as well as hotnologues. 

A preferred nucleic acid sequence according to an aspect 
of the present invention is shown in Figure 2 along with the 
predicted amino acid sequence. Nucleic acid may be subject to 
alteration by way of substitution of nucleotides and/or a 
combination of addition, insertion and/or substitution of one 
or more nucleotides with or without altering the encoded amino 
acids sequence (by virtue of the degeneracy of the genetic 
code) . 

As discussed below, further aspects of the present 
invention provide homologues of the Mlo sequence shown in 
Figure 2, including from rice (genomic sequence Figure 5, 
bottom line, cDNA sequence Figure 10, amino acid sequence 
Figure 13) and barley (genomic sequence Figure 6, bottom line. 
CDNA sequence Figure 11, amino acid sequence Figure 14); also 
Table 5B (nucleotide sequences) and Figure 5A (amino acid 
sequences) show homologous EST' s from rice and ArabidopsxB. 

The present invention also provides a vector which 
comprises nucleic acid with any one of the provided sequences, 
preferably a vector from which a product can be expressed. The 
vector is preferably suitable for transformation into a plant 
cell and/or a microbial cell. The invention further encompasses 
a host cell transformed with such a vector, especially a plant 
cell or a microbial cell (e.g. AgrobBCCerium tuwe faciens) . 
Thus, a host cell, such as a plant cell, comprising nucleic 
acid according to the present invention is provided. Within the 
cell, the nucleic acid may be incorporated within the nuclear 



r. 
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genome, i.e. a chromosotne . There may be more than one 
heterologous nucleotide sequence per haploid genome. 

A vector comprising nucleic acid according to the present 
invention need not include a promoter, particularly if the 
5 vector is to be used to introduce the nucleic acid into cells 
for recombination into the genome. 

Nucleic acid molecules and vectors according to the 
present invention may be provided in a form isolated and/or 
purified from their natural environment, in substantially pure 

10 or homogeneous form, or free or substantially free of nucleic 
acid or genes of the species of interest or origin other than 
the relevant sequence. Nucleic acid according to the present 
invention may comprise cDNA, RNA, genomic DNA and may be wholly 
or partially synthetic. The term ''isolate" may encompass all 

15 these possibilities. 

The present invention also encompasses the expression 
product of any of the nucleic acid sequences disclosed and 
methods of making the expression product by expression from 
encoding nucleic acid therefore under suitable conditions in 

20 suitable host cells, e.g. K. coll. Those skilled in the art 
are well able to construct vectors and design protocols for 
expression and recovery of products of recombinant gene 
expression. Suitable vectors can be chosen or constructed, 
containing one or more appropriate regulatory sequences, 

25 including promoter sequences, terminator fragments, 

polyadenylation sequences, enhancer sequences, marker genes and 
other sequences as appropriate. For further details see, for 
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example. Molecular Cloning: a Laboratory Manual: 2nd edition, 
Sambrook et al, 1989, Cold Spring Harbor Laboratory Press. 
Transformation procedures depend on the host used, but are well 
known. Meuiy known techniques and protocols for manipulation of 
nucleic acid, for example in preparation of nucleic acid 
constructs, mutagenesis, sequencing, introduction of DNA into 
cells and gene expression, and analysis of proteins, are 
described in detail in Short Protocols in Molecular Biology, 
Second Edition, Ausubel et al. eds., Johii Wiley & Sons, 1992. 
The disclosures of Sambrook et al . and Ausubel et al . are 
incorporated herein by reference, along with all other 
documents mentioned. 

Purified Mlo protein, or a fragment, mutant or variant 
thereof, e.g. produced recombinant ly by expression from 
encoding nucleic acid therefor, may be used to raise antibodies 
employing techniques which are standard in the art. Antibodies 
and polypeptides comprising antigen -binding fragments of 
antibodies may be used in identifying homologues from other 
species as discussed further below. 

Methods of producing antibodies include immunising a 
mammal (eg human, mouse, rat, rabbit, horse, goat, sheep or 
monkey) with the protein or a fragment thereof. Antibodies may 
be obtained from immunised animals using any of a variety of 
techniques known in the art, and might be screened, preferably 
using binding of antibody to antigen of interest. For 
instance. Western blotting techniques or immunoprecipitation 
may be used (Armitage et al, 1992, Nature 357-. 80-82). 
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Antibodies may be polyclonal or monoclonal. 

As an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be obtained 
from a recombinantly produced library of expressed 
5 immunoglobulin variable domains, eg using lambda bacteriophage 
or filamentous bacteriophage which display functional 
immunoglobulin binding domains on their surfaces; for instance 
see WO92/01047. 

T^tibodies raised to a polypeptide or peptide can be used 

10 in the identification and/or isolation of homologous 

polypeptides, and then the encoding genes. Thus, the present 
invention provides a method of identifying or isolating a 
polypeptide with Mlo or mlo function (in accordance with 
embodiments disclosed herein) , comprising screening candidate 

15 peptides or polypeptides with a polypeptide comprising the 
antigen-binding domain of an antibody (for example whole 
antibody or a fragment thereof) which is able to bind an Mlo or 
mlo peptide, polypeptide or fragment, variant or variant 
thereof or preferably has binding specificity for such a 

20 peptide or polypeptide, such as having an amino acid sequence 
identified herein. Specific binding members such as antibodies 
and polypeptides comprising antigen binding domains of 
antibodies that bind and are preferably specific for a Mlo or 
mlo peptide or polypeptide or mutant, variant or derivative 

25 thereof represent further aspects of the present invention, as 
do their use and methods which employ them. 

Candidate peptides or polypeptides for screening may for 
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instance be the products of an expression library created using 
nucleic acid derived from an plant of interest, or may be the 
product of a purification process from a natural source. 

A peptide or polypeptide found to bind the antibody may be 
isolated and then may be subject to amino acid sequencing. Any 
suitable technique may be used to sequence the peptide or 
polypeptide either wholly or partially (for instance a fragment 
of a polypeptide may be sequenced) . Amino acid sequence 
information may be used in obtaining nucleic acid encoding the 
peptide or polypeptide, for instance by designing one or more 
oligonucleotides (e.g. a degenerate pool of oligonucleotides) 
for use as probes or primers in hybridisation to candidate 
nucleic acid, or by searching computer sequence databases, as 
discussed further below. 

A further aspect of the present invention provides a 
method of identifying and cloning MJo homologues from plants, 
including species other than Barley, which method employs a 
nucleotide sequence derived from that shown in Figure 2. 
Further similar aspects employ a nucleotide sequence derived 
from any of the other Figures provided herein. Nucleic acid 
libraries may be screened using techniques well known to those 
skilled in the art and homologous sequences thereby identified 
then tested. The provision of sequence information for the Mlo 
gene of Barley and various homologues enables the obtention of 
homologous sequences from Barley and other plant species, as 
exemplified further herein. 

Also, one can easily derive PGR primers based on putative 
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exon sequences, which might be identified by comparison with 
the Mlo sequence provided in Figure 2 wherein exons are 
highlighted, and perform RT-PCR with total RNA from the plant 
of interest, e.g. barley and rice for the homologues shown in 
5 Figures 5 and 6 , with cDNA and amino acid sequences shown in 
other figures herein. 

The homologues whose nucleotide sequences are given and 
whose amino acid sequences are given or are deducible represent 
and provide further aspects of the present invention in 
10 accordance with those disclosed for the Barley gene shown in 
•Figure 2 . 

The present invention also extends to nucleic acid 
encoding a Mia homologue obtained using a nucleotide sequence 
derived from that shown in Figure 2 , or the amino acid sequence 

15 shown in Figure 2. Preferably, the nucleotide sequence and/or 
amino acid sequence shares homology with the sequence encoded 
by the nucleotide sequence of Figure 2, preferably at least 
about 50%, or at least. about 55%, or at least about 60%, or at 
least about 65%, or at least about 70%, or at least about 75%, 

20 or at least about 80% homology, or at least about 85% homology, 
or at least about 90% homology, most preferably at least about 
95% homology. **Horaology" in relation to an amino acid sequence 
may be used to refer to identity or similarity, preferably 
identity. High levels of amino acid identity may be limited to 

25 functionally significant domains or regions. 

A mutant, allele, variant or derivative amino acid 
sequence in accordance . with the present invention may include 
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within the sequence shown in Figure 2, a single amino acid 
change with respect to the sequence shown in Figure 2. or 2, 3, 
4. 5, 6, 7, 8, or 9 changes, about 10, 15, 20, 30, 40 or 50 
changes, or greater than about 50, 60, 70, 80 or 90 changes. 
In addition to one or more changes within the amino acid 
sequence shown in Figure 2, a mutant, allele, variant or 
derivative amino acid sequence may include additional amino 
acids at the C- terminus and/or N-terminus. 

As is well-understood, homology at the amino acid level is 
generally in terms of amino acid similarity or identity. 
Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoleucine, 
valine, leucine or methionine for another, or the substitution 
of one polar residue for another, such as arginine for lysine, 
glutamic for aspartic acid, or glutamine for asparagine. 
Similarity may be as defined and determined by the TBLASTN 
program, of Altschul et al . (1990) J. Mol . Biol. 215: 403-10, 
which is in standard use in the art, or, and this may be 
preferred, the standard program BestFit, which is part of the 
Wisconsin Package, Version 8, September 1994. (Genetics 
Computer Group. 575 Science Drive. Madison. Wisconsin. USA, 
Wisconsin 53711) . BestFit makes an optimal alignment of the 
best segment of similarity between two sequences. Optimal 
alignments are found by inserting gaps to maximize the number 
of matches using the local homology algorithm of Smith and 
Waterman 

Homology may be over the full-length of the relevant 
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sequence shown herein, or may more preferably be over a 
contiguous sequence of about or greater than about 20, 25, 30, 
33, 40, 50, 67, 133, 167, 200, 233, 267, 300, 333, 400, 450, 
500, 550, 6 00 or more amino acids or codons, compared with the 
relevant amino acid sequence or nucleotide sequence as the case 
may be . 

The EST sequences provided herein, have on average 70% 
similarity and 50% identity with the Mlo amino acid sequence of 
Figure 2, We show that the rice homologue (Figure 5) and 
barley homologue (Figure 6) have an amino acid identity of 81% 
(amino acid sequences shown in Figure 13 and Figure 14) - 

In certain Embodiments, an allele, variant, derivative, 
mutant or homologue of the specific sequence may show little 
overall homology, say about 20%, or about 2 5%, or about 30%, or 
about 35%, or about 40% or about 45%, with the specific 
sequence. However, in functionally significant domains or 
regions the amino acid homology may be much higher. Putative 
functionally significant domains or regions can be identified 
using processes of bioinf ormatics , including comparison of the 
sequences of homologues . Functionally significant domains or 
regions of different polypeptides may be combined for 
expression from encoding nucleic acid as a fusion protein. For 
example, particularly advantageous or desirable properties of 
different homologues may be combined in a hybrid protein, such 
that the resultant expression product, with Mlo or mlo 
function, may comprise fragments of various parent proteins. 
The nucleotide sequence information provided herein, or 
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any part thereof, may be used in a data-base search to find 
homologous sequences, expression products of which can be 
tested for Mlo or mlo function. These may have ability to 
complement a mlo mutant phenotype in a plant or may, upon 
expression in a plant, confer a mlo phenotype. 

In public sequence databases we recently identified 
several homologues for the sequence of Figure 2. We have 
already found homologues in rice and barley, and the dicot • 
AraJbl dopsl s . 

By sequencing homologues, studying their expression 
patterns and examining the effect of altering their expression, 
genes carrying out a similar function to Mlo in Barley are 
obtainable. Of course, mutants, variants and alleles of these 
sequences are included within the scope of the present 
invention in the same terms as discussed above for the Barley 
gene . 

Homology between the homologues as disclosed herein, may 
be exploited in the identification of further homologues, for 
example using oligonucleotides (e.g. a degenerate pool) 
designed on the basis of sequence conservation. 

According to a further aspect, the present invention 
provides a method of identifying or a method of cloning a Mlo 
homologue, e.g. from a species other than Barley, the method 
employing a nucleotide sequence derived from that shown in 
Figure 2 or that shown in any of the other Figures herein. For 
instance, such a method may employ an oligonucleotide or 
oligonucleotides which comprises or comprise a sequence or 
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sequences that are conserved between the sequences of Figures 2 
and/or 5 and/or 6 and/or 10 and/or 11 and/or 12, or encoding an 
amino acid sequence conserved between Figure 2 and/or 7 and/or 
13 and/or 14 and/or 15 to search for homologues . Thus, a 
method of obtaining nucleic acid is provided, comprising 
hybridisation of an oligonucleotide or a nucleic acid molecule 
comprising such an oligonucleotide to target /candidate nucleic 
acid. Target or candidate nucleic acid may, for example, 
comprise a genomic or cDNA library obtainable from an organism 
known to contain or suspected of containing such nucleic acid, 
either monocotyledonous or dicotyledonous • Successful 
hybridisation may be identified and target/candidate nucleic 
acid isolated for further investigation and/or use. 

Hybridisation may involve probing nucleic acid and 
identifying positive hybridisation under suitably stringent 
conditions (in accordance with known techniques) and/or use of 
oligonucleotides as primers in a method of nucleic acid 
amplification, such as PGR. For probing, preferred conditions 
are those which are stringent enough for there to be a simple 
pattern with a small number of hybridisations identified as 
positive which can be investigated further. It is well known 
in the art to increase stringency of hybridisation gradually 
until only a few positive clones remain. 

As an alternative to probing, though still employing 
nucleic acid hybridisation, oligonucleotides designed to 
amplify DNA sequences may be used in PGR reactions or other 
methods involving amplification of nucleic acid, using routine 
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procedures. See for instance "PGR protocols; A Guide to 
Methods and Applications", Eds. Innis et al, 1990, Academic 
Press, New York. 

Preferred amino acid sequences suitable for use in the 
5 design of probes or PGR primers for some purposes are sequences 
conserved (completely, substantially or partly) between at 
least two Mlo peptides or polypeptides encoded by genes able to 
suppress a defence response in a plant, e.g. with any of the 
amino acid sequences of any of the various figures herein 
10 and/or encoded by the nucleotide sequences of any of the 
various figures herein. 

On the basis of amino acid sequence information 
oligonucleotide probes or primers may be designed, taking into 
account the degeneracy of the genetic code, and, where 
15 appropriate, codon usage of the organism from the candidate 
nucleic acid is derived. 

Preferably an oligonucleotide in accordance with certain 
embodiments of the invention, e.g. for use in nucleic acid 
amplification, is up to about 50 nucleotides, or about 40 
20 nucleotides or about 30 or fewer nucleotides in length (e.g. 
18, 21 or 24) . 

Assessment of whether or not such a PGR product 
corresponds to Mlo homologue genes may be conducted in various 
ways . A PGR band from such a reaction might contain a complex 
25 mix of products. Individual products may be cloned and each 
one individually screened. It may be analysed by 
transformation to assess function on introduction into a plant 
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of interest . 

As noted, nucleic acid according to the present invention 
is obtainable using oligonucleotides, designed on the basis of 
sequence information provided herein, as probes or primers. 
Nucleic acid isolated and/or purified from one or more cells of 
barley or another plant (see above) , or a nucleic acid library- 
derived from nucleic acid isolated and/or purified from the 
plant (e.g. a cDNA library derived from mRNA isolated from the 
plant) , may be probed under conditions for selective 
hybridisation and/or subjected to a specific nucleic acid 
amplification reaction such as the polymerase chain reaction 
(PGR) . The nucleic acid probed or used as template in the 
amplification reaction may be genomic DNA, cDNA or RNA. If 
necessary, one or more gene fragments may be ligated to 
generate a full-length coding sequence. 

We have tested several PGR primers derived from the Mlo 
sequence disclosed herein to test their specificity for 
amplifying nucleic acid according to the present invention, 
using both barley genomic DNA and RT-PCR templates. The latter 
was synthesized from barley polyA"" RNA. In each case we were 
able to amplify the expected Aflo derived gene fragments as 
shown by cloning and subsequent DNA sequencing of the PGR 
products. Full length cDNA clones can be obtained as described 
by 5' and 3' RACE technology if RT-PCR products are used as 
templates . 

Examples of primers tested include: 
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25L 5'-GTG CAT CTG CGT GTG CGT A-3' 

25LN 5' -GTG TGC GTA CCT GGT AGA G-3' 

25R 5'-AAC GAG GTC TGG TGC GTG- 3' 

3 3 5' -TGC AGC TAT ATG ACC TTC CCC CTC-3' 

37 5'-GGA CAT GCT GAT GGC TCA GA-3' 

38 5'-CAG AAC TTG TCT CAT CCC TG-3 
38A 5' -GGC TAT ACA TTG GGA CTA ACA-3' 
3 SB 5'-CGA ATC ATC ACA TCC TAT GTT-3' 

39 5'-GCA AGT TCG ACT TCC AC- 3' 

3 9A 5' -TCG ACT TCC ACA AGT ACA TCA- 3' 

53 5' -AGC GTA CCT GCG TAG GTA G-3' 



Various primer combinations have been tested: 
38/39A; 38/39; 38/33; 38/37; 38A/39A; 38B/39A; 38/25L; 38/25LN; 
25R/25L; 25R/25IiN; 25R/53 • 

Various aspects of the present invention include the 
obtainable nucleic acid, methods of screening material, e.g, 
cell lysate, nucleic acid preparations, for the presence of 
nucleic acid of interest, methods of obtaining the nucleic 
acid, and the primers and primer combinations given above . 

The sequence information provided herein also allows the 
design of diagnostic tests for determination of the presence of 
a specific mlo resistance allele, or a susceptibility allele 
(e.g. wild- type ) , in any given plant, cultivar, variety, 
population, landrace, part of a family or other selection in a 
breeding programme or other such genotype. A diagnostic test 
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may be based on determination of the presence or absence of a 
particular allele by means of nucleic acid or polypeptide 
determination. 

At the nucleic acid level, this may involve hybridisation 
5 of a suitable oligo- or poly-nucleotide , such as a fragment of 
the Mlo gene or a homologue thereof, including any homologue 
disclosed herein, or any particular allele, such as an allele 
which gives an mlo phenotype, such as any such allele disclosed 
herein. The hybridisation may involve PGR designed to amplify 

10 a product from a given allelic version of mlo, with subsequent 
detection of an amplified product by any of a number of 
possible methods including but not limited to gel 
electrophoresis, capillary electrophoresis, direct 
hybridisation of nucleotide sequence probes and so on. A 

15 diagnostic test may be based on PGR designed to amplify various 
alleles or any allele from the Mlo locus, with a test to 
distinguish the different possible alleles by any of a number 
of possible methods, including DNA fragment size, restriction 
site variation (e.g. CAPS - cleaved amplified polymorphic 

2 0 sites) and so on . A diagnostic test may also be based on a 

great number of possible variants of nucleic acid analysis that 
will be apparent to those skilled in the art, such as use of a 
synthetic jnlo-derived sequence as a hybridisation probe. 

Broadly, the methods divide into those screening for the 

25 presence of nucleic acid sequences and those that rely on 
detecting the presence or absence of a polypeptide. The 
methods may make use of biological samples from one or more 
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plants or cells that are suspected to contain the nucleic acid 
sequences or polypeptide. 

Exemplary approaches for detecting nucleic acid or 
polypeptides include analysing a sample from the plant or plant 
cell by: 

(a) comparing the sequence of nucleic acid in the sample 
with all or part of the nucleotide sequence shown in Figure 7 
to determine whether the sample from the patient contains a 
mutation; 

(b) determining the presence in the sample of a 
polypeptide including the amino acid sequence shown in Figure 2 
or a fragment thereof and, if present, determining whether the 
polypeptide is full length, and/or is mutated, and/or is 
expressed at the normal level; 

(c) performing DNA fingerprinting to compare the 
restriction pattern produced when a restriction enzyme cuts 
nucleic acid in the sample with the restriction pattern 
obtained from the nucleotide sequence shown in Figure 7 or from 
a known mutant, allele or variant thereof; 

(d) contacting the sample with a specific binding member 
capable of binding to nucleic acid including the nucleotide 
sequence as set out in Figure 7 or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 
including nucleic acid hybridisable with the sequence of Figure 
7 or a polypeptide including a binding domain with specificity 
for nucleic acid including the sequence of Figure 7 or the 
polypeptide encoded by it, or a mutated form thereof, and 



wo 98/04586 



PCT/GB97/02046 



22 

determining binding of the specific binding member; 

(e) performing PGR involving one or more primers based on 
the nucleotide sequence shown in Figure 7 to screen the sample 
for nucleic acid including the nucleotide sequence of Figure 7 
5 or a mutant, allele or variant thereof. 

When screening for a resistance allele nucleic acid, the 
nucleic acid in the sample will initially be amplified, e.g. 
using PGR, to increase the amount of the analyte as compared to 
other sequences present in the sample . This allows the target 

10 sequences to be detected with a high degree of sensitivity if 
they are present in the sample. This initial step may be 
avoided by using highly sensitive array techniques that are 
becoming increasingly important in the art . 

A variant form of the gene may contain one or more 

15 insertions, deletions, substitutions and/or additions of one or 
more nucleotides compared with the wild- type sequence (such as 
shown in Table 1) which may or may not disrupt the gene 
function. Differences at the nucleic acid level are not 
necessarily reflected by a difference in the amino acid 

20 sequence of the encoded polypeptide. However, a mutation or 

other difference in a gene may result in a frame -shift or stop 
codon, which could seriously affect the nature of the 
polypeptide produced (if any) , or a point mutation or gross 
mutational change to the encoded polypeptide, including 

25 insertion, deletion, substitution and/or addition of one or 

more amino acids or regions in the polypeptide. A mutation in 
a promoter sequence or other regulatory region may prevent or 
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reduce expression from the gene or affect the processing or 
stability of the mRNA transcript. 

Tests may be carried out on preparations containing 
genomic DNA, cDNA and/or mRNA. Testing cDNA or mRNA has the 
advantage of the complexity of the nucleic acid being reduced 
by the absence of intron sequences, but the possible 
disadvantage of extra time and effort being required in making 
the preparations. RNA is more difficult to manipulate than DNA 
because of the wide-spread occurrence of RN'ases. 

Nucleic acid in a test sample may be sequenced and the 
sequence compared with the sequence shown in Figure 2, or other 
figure herein, to determine whether or not a difference is 
present. If so, the difference can be compared with known 
susceptibility alleles (e.g. as summarised in Table 1) to 
determine whether the test nucleic acid contains one or more of 
the variations indicated, or the difference can be investigated 
for association- with disease resistance . 

The amplified nucleic acid may then be sequenced as above, 
and/or tested in any other way to determine the presence or 
absence of a particular feature. Nucleic acid for testing may 
be prepared from nucleic acid removed from cells or in a 
library using a variety of other techniques such as restriction 
enzyme digest and electrophoresis. 

Nucleic acid may be screened using a variant- or allele- 
specific probe. Such a probe corresponds in sequence to a 
region of the gene, or its complement, containing a sequence 
alteration known to be associated with disease resistance. 
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Under suitably stringent conditions, specific hybridisation of 
such a probe to test nucleic acid is indicative of the presence 
of the sequence alteration in the test nucleic acid. For 
efficient screening purposes, more than one probe may be used 
5 on the same test sample . 

Allele- or variant -specific oligonucleotides may similarly 
be used in PGR to specifically amplify particular sequences if 
present in a test sample. Assessment of whether a PGR band 
contains a gene variant may be carried out in a number of ways 

10 familiar to those skilled in the art. The PGR product may for 
instance be treated in a way that enables one to display the 
mutation or polymorphism on a denaturing polyacrylamide DNA 
sequencing gel, with specific bands that are linked to the gene 
variants being selected. 

15 An alternative or supplement to looking for the presence 

of variant sequences in a test sample is to look for the 
presence of the normal sequence, e.g. using a suitably specific 
oligonucleotide probe or primer. 

Approaches which rely on hybridisation between a probe and 

20 test nucleic acid and subsequent detection of a mismatch may be 
employed. Under appropriate conditions (temperature, pH etc.), 
an oligonucleotide probe will hybridise with a sequence which 
is not entirely complementary. The degree of base-pairing 
between the two molecules will be sufficient for them to anneal 

25 despite a mis-match. Various approaches are well known in the 
art for detecting the presence of a mis-match between two 
annealing nucleic acid molecules. 
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For instance. RN'ase A cleaves at the site of a mis-match. 
Cleavage can be detected by electrophoresing test nucleic acid 
to which the relevant probe or probe has annealed and looking 
for smaller molecules (i.e. molecules with higher 
electrophoretic mobility) than the full length probe/test 
hybrid. Other approaches rely on the use of enzymes such as 
resolvases or endonucleases . 

Thus, an oligonucleotide probe that has the sequence of a 
region of the normal gene (either sense or anti-sense strand) 
in which mutations associated with disease resistance are Icnown 
to occur (e.g. see Table 1) may be annealed to test nucleic 
acid and the presence or absence of a mis -match determined. 
Detection of the presence of a mis -match may indicate the 
presence in the test nucleic acid of a mutation associated with 
disease resistance. On the other hand, an oligonucleotide 
probe that has the sequence of a region of the gene including a 
mutation associated with disease resistance may be annealed tp 
test nucleic acid and the presence or absence of a mis -match 
determined. The presence of a mis -match may indicate that the 
nucleic acid in the test sample has the normal sequence, or a 
different mutant or allele sequence. In either case, a battery 
of probes to different regions of the gene may be employed. 

The presence of differences in sequence of nucleic acid 
molecules may be detected by means of restriction enzyme 
digestion, such as in a method of DNA fingerprinting where the 
restriction pattern produced when one or more restriction 
enzymes are used to cut a sample of nucleic acid is compared 
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with the pattern obtained when a sample containing the normal 
gene or a variant or allele is digested with the same enzyme or 
enzymes . 

The presence of absence of a lesion in a promoter or other 
5 regulatory sequence may also be assessed by determining the 
level of mRNA production by transcription or the level of 
polypeptide production by translation from the mRNA. 

Nucleic acid isolated and/or purified from one or more 
cells of a plant or a nucleic acid library derived from nucleic 

10 acid isolated and/or purified from cells (e.g. a cDNA library 

derived from mRNA isolated from the cells) , may be probed under 
conditions for selective hybridisation and/or subjected to a 
specific nucleic acid amplification reaction such as the 
polymerase chain reaction (PCR) . 

15 A method may include hybridisation of one or more (e.g. 

two) probes or primers to target nucleic acid. Where the 
nucleic acid is double -stranded DNA, hybridisation will 
generally be preceded by denaturation to produce single- 
stranded DNA. The hybridisation may be as part of a PCR 

20 procedure, or as part of a probing procedure not involving PCR. 
An example procedure would be a combination of PCR and low 
stringency hybridisation. A screening procedure, chosen from 
the many available to those skilled in the art, is used to 
identify successful hybridisation events and isolate hybridised 

25 nucleic acid. 

Binding of a probe to target nucleic acid (e.g. DNA) may 
be measured using any of a variety of techniques at the 
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disposal of those skilled in the art. For instance, probes may 
be radioactively, fluorescent ly or enzymatically labelled. 
Other methods not employing labelling of probe include 
examination of restriction fragment length polymorphisms, 
amplification using PGR, RNAase cleavage and allele specific 
oligonucleotide probing. 

Probing may employ the standard Southern blotting 
technique. For instance DNA may be extracted from cells and 
digested with different restriction enzymes. Restriction 
fragments may then be separated by electrophoresis on an 
agarose gel, before denaturation and transfer to a 
nitrocellulose filter. Labelled probe may be hybridised to the 
DNA fragments on the filter and binding determined. DNA for 
probing may be prepared from RNA preparations from cells - 

Preliminary experiments may be performed by hybridising 
under low stringency conditions various probes to Southern 
blots of DNA digested with restriction enzymes. Suitable 
conditions would be achieved when a large number of hybridising 
fragments were obtained while the background hybridisation was 
low. Using these conditions nucleic acid libraries, e.g. cDNA 
libraries representative of expressed sequences, may be 
searched . 

As noted, those skilled in the art are well able to employ 
suitable conditions of the desired stringency for selective 
hybridisation, taking into account factors such as 
oligonucleotide length and base composition, temperature and so 
on . 
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In some preferred embodiments of diagnostic assays 
according to the present invention, oligonucleotides according 
to the present invention that are fragments of any of the 
sequences shown in Figure 2, or any allele associated with 
disease resistance, e.g. as identified in Table 1, are at least 
about 10 nucleotides in length, more preferably at least about 
15 nucleotides in length, more preferably at least about 20 
nucleotides in length, more preferably about 3 0 nucleotides in 
length. Such fragments themselves individually represent 
aspects of the present invention. Fragments and other 
oligonucleotides may be used as primers or probes as discussed 
but may also be generated (e.g. by PGR) in methods concerned 
with determining the presence in a test sample of a sec[uence 
indicative of disease resistance. 

There are various methods for determining the presence or 
absence in a test sample of a particular polypeptide, such as 
the polypeptide with the amino acid sequence shown in Figure 2, 
or other figure herein, or an amino acid sequence mutant, 
variant or allele thereof (e.g. including an alteration shown 
in Table 1) . 

A sample may be tested for the presence of a binding 
partner for a specific binding member such as an antibody (or 
mixture of antibodies) , specific for one or more particular 
variants of the polypeptide shown in Figure 2, e.g. see Table 
1 . 

In such cases, the sample may be tested by being contacted 
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with a specific binding member such as an antibody under 
appropriate conditions for specific binding, before binding is 
determined, for instance using a reporter system as discussed. 
Where a panel of antibodies is used, different reporting labels 
may be employed for each antibody so that binding of each can 
be determined. 

A specific binding member such as an antibody may be used 
to isolate and/or purify its binding partner polypeptide from a 
test sample, to allow for sequence and/or biochemical analysis 
of the polypeptide to determine whether it has the sequence 
and/or properties of the wild-type polypeptide or a particular 
mutant, variant or allele thereof. Amino acid sequence is 
routine in the art using automated sequencing machines. 

The use of diagnostic tests for mlo alleles allows the 
researcher or plant breeder to establish, with full confidence 
and independent from time consuming resistance tests, whether 
or not a desired allele is present in the plant of interest (or 
a cell thereof) , whether the plant is a representative of a 
collection of other genetically identical plants (e.g. an 
inbred variety or cultivar) or one individual in a sample of 
related (e.g. breeders' selection) or unrelated plants. The 
mlo alleles conferring the desirable disease resistance 
phenotype are recessive, and are not therefore detectable at 
the whole plant phenotype level when in a heterozygous 
condition in the presence of a wild-type HHo allele. 
Phenotypic screening for the presence of such recessive alleles 
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is therefore only possible on material homozygous for the mlo 
locus and so delays substantially the generation in a plant 
breeding programme at which selection can be reliably and cost- 
effectively applied. In a backcross breeding programme where « 
5 for example, a breeder is aiming to introgress a desirable mlo 
allele into an elite adapted high performing target genotype, 
the mio locus will be permanently in the heterozygous condition 
until selfing is carried out. Nucleic acid or polypeptide 
testing for the presence of the recessive allele avoids the 

10 need to test self ed progeny of backcross generation 

individuals, thus saving considerable time and money. In other 
types of breeding scheme based on selection and selfing of 
desirable individuals, nucleic acid or polypeptide diagnostics 
for the desirable mlo alles in high throughput, low cost assays 

15 as provided by this invention, reliable selection for the 

desirable mlo alleles can be made at early generations and on 
more material than would otherwise be possible. This gain in 
reliability of selection plus the time saving by being able to 
test material earlier and without costly resistance phenotype 

20 screening is of considerable value in plant breeding. 

By way of example for nucleic acid testing, the barley 
jnio-5 resistance allele is characterized by a G- to A- 
nucleotide substitution in the predicted start codon of the Mlo 
gene (Table l) . The mutation may easily be detected by 

25 standard PGR amplification of a Mlo gene segment from genomic 
template DNA with the primers: 

forward primer : 5 ' -GTTGCCACACTTTGCCACG- 3 ' 
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reverse primer : 5 ' - AAGCCAAGACGACAATCAGA- 3 ' 

(for example) , followed by digestion witht he restriction 
enzyme PshAl . This generates a cleaved amplified polymorphic 
sequences (CAPS) marker which may be displayed using 
conventional agarose gel electrophoresis. Presence of a 769 bp 
fragment is indicative of the presence of the allele. 

The mlO'9 resistance allele is characterized by a C- to T- 
nucleotide substitution (Table 1) . This allele is of 
particular relevance since it is used frequently in breeding 
material. The mutational event may be easily detected using 
the primers : 

forward primer 5 ' -GRRGCCACACTTTGCCACG-3 ' 
reverse primer 5' -AAGCCAAGACGACAATCAGA-3 ' 
(for example) and subsequent digestion of genomic amplification 
products with the restriction enzyme Hhal - This generates a 
CAPS marker which may be displayed by conventional agarose gel 
electrophoresis. The presence of a 374 bp fragment is 
indicative of the presence of mlo-S. 

A third, particularly interesting allele is mlo-12, 
characterised by a substitution a residue 24 0, specifically a 
Phe24 0 to leucine replacement- This may result from a C720 to 
A substitution in the encoding nucleotide sequence (Table 1) . 
This is the only currently documented mla allele for which 
conclusive evidence is available that the altered protein 
retains residual wild-type activity (Hentrich, 1979, Arch. 
Zuchtun^svorsch. , Berlin 3, B. 283-291). mlo-12 exhibits no 
detectable spontaneous cell death reaction but confers a 
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sufficient level of resistance to pathogens such as the powdery 
mildew fungus. nilO'-12 may therefore be the allele of choice in 
breeding programs if minimal pleiotropic effects (spontaneous 
cell death) are desirable after introgression of the mlo 
resistance in elite breeding lines. Furthermore, the molecular 
site of the amino acid substitution within the Mlo protein 
allows the design of alleles with a residual wild-type 
activity, and also the obtention of interacting and/or 
inhibitory molecules, reducing undesirable pleiotropic effects 
from a complete loss of function of the Mlo protein. 

Nucleic acid-based determination of the presence or 
absence of mla alleles may be combined with determination of 
the genotype of the flanking linked genomic DNA and other 
unlinked genomic DNA using established sets of markers such as 
RFLPs, microsatellites or SSRs, AFLPs, RAPDs etc. This enables 
the researcher or plant breeder to select for not only the 
presence of the desirable mlo allele but also for individual 
plant or families of plants which have the most desirable 
combinations of linked and unlinked genetic background. Such 
recombinations of desirable material may occur only rarely 
within a given segregating breeding population or backcross 
progeny. Direct assay of the mla locus as afforded by the 
present invention allows the researcher to make a stepwise 
approach to fixing (making homozygous) the desired combination 
of flanking markers and mlo alleles, by first identifying 
individuals fixed for one flanking marker and then identifying 
progeny fixed on the other side of the mlo locus all the time 
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knowing with confidence that the desirable mlo allele is still 
present . 

The present disclosure provides sufficient information for 
a person skilled in the art to obtain genomic DNA sequence for 
any given new or existing mlo allele and devise a suitable 
nucleic acid- and/or polypeptide -based diagnostic assay. 
Existing mlo alleles to which this may be applied include, for 
example, mlo-l, mlo- 3, mIo-4 , mlo- 5, mlo- 6, mlo- 7. mlo- 8, mlo- 
9, mlo-10. mlo-12, mIo-13, m2o-16, mIo-17. mIo-26 and mlo-28. 
for all of which sequence information is provided herein (see 
e.g. Figure. 2 and Table 1). In designing a nucleic acid assay 
account is taken of the distinctive variation in sequence that 
characterises the particular variant allele. Thus, the present 
invention extends to an oligonucleotide fragment of a mlo 
allele, having a sequence which allows it to hybridise 
specifically to that allele as compared with other mlo alleles. 
Such an oligonucleotide spans a nucleotide at which a mlo 
mutation occurs, and may include the mutated nucleotide at or 
towards its 3' or 5' end. Such an oligonucleotide may 
hybridise with the sense or anti-sense strand. The variation 
may be within the coding sequence of the mlc gene, or may lie 
within an intron sequence or in an upstream or downstream non- 
coding sequence, wherein disruption affects or is otherwise 
related to the lesion in Mlo that results in the mildew 
resistant phenotype. 

The mlo- 9 allele is widely but not exclusively used in 
plant breeding (J Helms Jorgensen - Euphytica (1992) 63: 141- 
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breeding has largely been restricted to spring barley, because 
the spontaneous cell death response associated with many of the 
mutant alleles appears to represent a penalty to plant growth 
5 and performance when incorporated into high yielding winter 
barley genotypes. However different mlo alleles have 
different degrees of associated spontaneous cell death 
response, and thus some, either existing or newly created from 
mutagenesis programmes or isolated as spontaneous mutants, are 

10 more suitable than others for incorporation into winter barley 
backgrounds* The jnlo-12 allele may be particularly suitable 
since no detectable pleiotropic effects occur despite 
conferring a sufficient level of pathogen resistance. The use 
of mlo based mildew resistance more widely in winter barleys 

15 will have significant value for barley growers as well as 

significant economic and environmental implications such as 
reduced use of fungicide inputs with their associated treatment 
costs. The provision of nucleic acid diagnostics as provided 
herein enables rapid and accurate deployment of new and 

20 existing mlo alleles into winter barley germplasm. 

Plants which include a plant cell according to the 
invention are also provided, along with any part or propac^ule 
thereof, seed, selfed or hybrid progeny and descendants. A 
25 plant according to the present invention may be one which does 
not breed true in one or more properties . Plant varieties may 
be excluded, particularly registrable plant varieties according 
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to Plant Breeders' Rights. It is noted that a plant need not 
be considered a "plant variety" simply because it contains 
stably within its genome a transgene, introduced into a cell of 
the plant or an ancestor thereof . 

In addition to a plant, the present invention provides any 
clone of such a plant, seed, selfed or hybrid progeny and 
descendants, and any part of any of these, such as cuttings, 
seed. The invention provides any plant propagule, that is any 
part which may be used in reproduction or propagation, sexual 
or asexual, including cuttings, seed and so on. Also 
encompassed by the invention is a plant which is a sexually or 
asexually propagated off -spring, clone or descendant of such a 
plant, or any part or propagule of said plant, off -spring, 
clone or descendant. 

A further aspect of the present invention provides a 
method of making a plant cell involving introduction of the 
sequence (e.g. as part of a suitable vector) into a plant cell 
and causing or allowing recombination between the vector and 
the plant cell genome to introduce the sequence of nucleotides 
into the genome . 

Following transformation of a plant cell a plant may be 

regenerated . 

The invention further provides a method of modulating Mlo 
expression in a plant, which may modulate a defence response in 
the plant, comprising expression of a heterologous Mlo gene 
sequence (or mutant, allele, variant or homologue thereof, as 
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discussed) within cells of the plant. As discussed further 
herein, modulation or alteration of the level of constitutive 
defence response in a plant may be by way of suppression, 
repression or reduction (in the manner of wild- type Mlo) or 
promotion, stimulation, activation, increase, enhancement or 
augmentation (in the manner of mutant mlo) . Activation or 
enhancement of the defence response may confer or increase 
pathogen resistance of the plant, especially resistance to 
powdery mildew and/or rust (such as yellow rust) . 

The term "heterologous" may be used to indicate that the 
gene/sequence of nucleotides in question have been introduced 
into said cells of the plant or an ancestor thereof, using 
genetic engineering, ie by human intearvention . A transgenic 
plant cell, i.e. transgenic for the nucleic acid in question, 
may be provided. The transgene may be on an extra-genomic 
vector or incorporated, preferably stably, into the genome. A 
heterologous gene may replace an endogenous equivalent gene, ie 
one which normally performs the same or a similar function, or 
the inserted sequence may be additional to the endogenous gene 
or other sequence. An advantage of introduction of a 
heterologous gene is the ability to place expression of a 
sequence under the control of a promoter of choice, in order to 
be able to influence expression according to preference, such 
as under particular developmental, spatial or temporal control, 
or under control of an inducible promoter. Furthermore, 
mutants, variants and derivatives of the wild-type gene, e.g. 
with higher or lower activity than wild- type, may be used in 
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place of the endogenous gene. Nucleic acid heterologous, or 
exogenous or foreign, to a plant cell may be non-naturally 
occuring in cells of chat type, variety or species. Thus, 
nucleic acid may include a coding sequence of or derived from a 
particular type of plant cell or species or variety of plant, 
placed within the context of a plant cell of a different type 
or species or variety of plant. A further possibility is for a 
nucleic acid sequence to be placed within a cell in which it or 
a homologue is found naturally, but wherein the nucleic acid 
sequence is linked and/or adjacent to nucleic acid which does 
not occur naturally within the cell, or cells of that type or 
species or variety of plant, such as operably linked to one or 
more regulatory sequences, such as a promoter sequence, for 
control of expression. A sequence within a plant or other host 
cell may be identifiably heterologous, exogenous or foreign. 

Down-regulation of wild- type Mlo gene function leads to 
stimulation of a constitutive defence response. This may be 
achieved in a number of different ways, as illustrated below. 

The nucleic acid according to the invention may be placed 
under the control of an inducible gene promoter thus placing 
expression under the control of the user. 

In a further aspect the present invention provides a gene 
construct comprising an inducible promoter operatively linked 
to a nucleotide sequence provided by the present invention. As 
discussed, this enables control of expression of the gene. The 
invention also provides plants transformed with said gene 
construct and methods comprising introduction of such a 
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construct into a plant cell and/or induction of expression of a 
construct within a plant cell, e.g. by application of a 
suitable stimulus, such as an effective exogenous inducer or 
endogenous signal . 
5 The term "inducible" as applied to a promoter is well 

understood by those skilled in the art. In essence, expression 
under the control of an inducible promoter is "switched on" or 
increased in response to an applied stimulus (which may be 
generated within a cell or provided exogenously) . The nature of 

10 the stimulus varies between promoters. Some inducible promoters 
cause little or undetectable levels of expression (or no 
expression) in the absence of the appropriate stimulus. Other 
inducible promoters cause detectable constitutive expression in 
the absence of the stimulus. Whatever the level of expression 

15 is in the absence of the stimulus, expression from any 

inducible promoter is increased in the presence of the correct 
stimulus. The preferable situation is where the level of 
expression increases upon application of the relevant stimulus 
by an amount effective to alter a phenotypic characteristic. 

20 Thus an inducible (or "switchable" ) promoter may be used which 
causes a basic level of expression in the absence of the 
stimulus which level is too low to bring about a desired 
phenotype (and may in fact be zero) . Upon application of the 
stimulus, expression is increased (or switched on) to a level 

2 5 which brings about the desired phenotype. 

Suitable promoters include the Cauliflower Mosaic Virus 
35S (CaMV 35S) gene promoter that is expressed at a high level 
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in virtually all plant tissues (Benfey et al, (1990a) EMBO J 9: 
1677-1684); the cauliflower meri 5 promoter that is expressed 
in the vegetative apical meristem as well as several well 
localised positions in the plant body, eg inner phloem, flower 
primordia, branching points in root and shoot (Medford, J.I. 
(1992) Plant Cell 4/ 1029-1039; Medford et al, (1991) Plant 
Cell 3, 359-370) and the Arabldopsie thalxana LEAFY promoter 
that is expressed very early in flower development (Weigel et 
al, (1992) Cell SB, 843-859). 

An aspect of the present invention is the use of nucleic 
acid according to the invention in the production of a 
transgenic plant. 

When introducing a chosen gene construct into a cell, 
certain considerations must be taken into account, well known 
to those skilled in the art. The nucleic acid to be inserted 
should be assembled within a construct which contains effective 
regulatory elements which will drive transcription. There must 
be available a method of transporting the construct into the 
cell. Once the construct is within the cell membrane, 
integration into the endogenous chromosomal material either 
will or will not occur. Finally, as far as plants are concerned 
the target cell type must be such that cells can be regenerated 
into whole plants. 

Plants transformed with the DNA segment containing the 
sequence may be produced by standard techniques which are 
already known for the genetic manipulation of plants. DNA can 
be transformed into plant cells using any suitable technology. 
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such as a disarmed Ti-plasmid vector carried by Agrobacterium 
exploiting its natural gene transfer ability (EP-A-270355 , EP- 
A-0116718, NAR 12(22) 8711 - 87215 1984), particle or 
microprojectile bombardment (US 5100792, EP-A-444882, EP-A- 
5 434616) microinjection (WO 92/09696, WO 94/00583, EP 331083, EP 
175966, Green et al . (1987) Plant Tissue and Cell Culture, 
Academic Press), electroporation (EP 290395, WO 8706614) other 
forms of direct DNA uptake (DE 4005152, WO 9012096, US 
4684611) , liposome mediated DNA uptake (e.g. Freeman et al . 

10 Plant Cell Physiol. 29: 1353 (1984)), or the vortexing method 

(e.g. Kindle, PNAS U.S.A. 87: 1228 (1990d) Physical methods for 
the transformation of plant cells are reviewed in Oard, 1991, 
Biotech. Adv. 9: l-ll. 

Agrobacterium transformation is widely used by those 

15 skilled in the art to transform dicotyledonous species. 

Recently, there has been substantial progress towards the 
routine production of stable, fertile transgenic plants in 
almost all economically relevant monocot plants (Toriyama, et 
al. (1988) Bio/Technoloffy 6, 1072-1074; Zhang, et ai . (1988) 

20 Plant Cell Rep. 7, 379-384; Zhang, et al . (1988) Theor Appl 
Genet IG, 835-840; Shimamoto, et al . (1989) Nature 338, 274- 
276; Datta, et al . (1990) Bio /Technology 8, 736-740; Christou, 
et al. (1991) Bio /Technology 9, 957-962; Peng, et al . (1991) 
International Rice Research Institute, Manila, Philippines 563- 

25 574; Cao, et al . (1992) Plant Cell J^ep. 11, 585-591; Li, et al . 
(1993) Plant Cell Rep. 12, 250-255; Rathore, et al . (1993) 
Plant Molecular Biology 21, 871-884; Fromm, et al . (1990) 
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Bio/Technology B. 833-839; Gordon - Kamm , et al . (1990) Plant 
Cell 2, 603-618; D'Halluin, et al . (1992) Plant Cell 4, 1495- 
1505; Walters, et al . (1992) Plant Molecular Biologry 18, 189- 
200; Koziel, et al . (1993) Biotechnology 11, 194-200; Vasil, I. 
K. (1994) Plant Molecular Biology 25, 925-937; Weeks, et al . 
(1993) Plajit Physiology 102, 1077-1084; Somers, et al . (1992) 
Bio/Technology ±0. 1589-1594; W092/14828) . In particular, 
Agrobacterium mediated transformation is now emerging also as 
an highly efficient alternative transformation method in 
monocots (Hiei et al . (1994) The Plant Journal 6, 271-282). 

The generation of fertile transgenic plants has been 
achieved in the cereals rice, maize, wheat, oat, and barley 
(reviewed in Shimamoto, K. (1994) Current Opinion in 
Biotechnology 5, 158-162.; Vasil, et al . (1992) Bio/Technology 
10, 667-674; Vain et al . , 1995, Biotechnology Advances 13 (4): 
653-671; Vasil, 1996, Nature Biotechnology 14 page 702). 

Microprojectile bombardment, electroporation and direct 
DNA upta)ce are preferred where Agrobacterium is inefficient or 
ineffective. Alternatively, a combination of different 
techniques may be employed to enhance the efficiency of the 
transformation process, eg bombardment with Agrobacterium 
coated microparticles (EP-A-486234) or microprojectile 
bombardment to induce wounding followed by co- cultivation with 
Agrobacterium (EP-A-4 86233) . 

Following transformation, a plant may be regenerated, e.g. 
from single cells, callus tissue or leaf discs, as is standard 
in the art. Almost any plant can be entirely regenerated from 
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cells, tissues and organs of the plant. Available techniques 
are reviewd in Vasil et al., Cell Culture and Somatic Cel 
Genetics of Plants, Vol I, II and III, Lahoratory Procedures 
and Their Applications , Academic Press, 1984, and Weissbach and 
Weissbach, Methods for Plant Molecular Biology, Academic Press, 
1989 . 

The particular choice of a transformation technology will 
be determined by its efficiency to transform certain plant 
species as well as the experience and preference of the person 
practising the invention with a particular methodology of 
choice. It will be apparent to the skilled person that the 
particular choice of a transformation system to introduce 
nucleic acid into plant cells is not essential to or a 
limitation of the invention, nor is the choice of technique for 
plant regeneration. 

In the present invention, expression may be achieved by 
introduction of the nucleotide sequence in a sense orientation. 
Thus, the present invention provides a method of modulation of 
a defence response in a plant, the method comprising causing or 
allowing expression of nucleic acid according to the invention 
within cells of the plant. Generally, it will be desirable to 
stimulate the defence response, and this may be achieved by 
disrupting Mlo gene function. 

Down- regulation of expression of a target gene may be 
achieved using ant i- sense technology or "sense regulation" 
("co-suppression") . 

In using anti-sense genes or partial gene sequences to 
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down-regulate gene expression, a nucleotide sequence is placed 
under the control of a prornoter in a "reverse orientation" such 
that transcription yields RNA which is complementary to normal 
mRNA transcribed from the "sense" strand of the target gene. 
See, for example, Rothstein et al, 1987; Smith et al, (1988) 
Nature 334, 724-726; Zhang et al , (1992) The Plant Cell 4, 1575- 
1588, English et aJ - , (1996) The Plant Cell 8, 179-188. 
Antisense technology is also reviewed in Bourque, (1995) , Plant 
Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 3490- 
3496. 

An alternative is to use a copy of all or part of the 
target gene inserted in sense, that is the same, orientation as 
the target gene, to achieve reduction in expression of the 
target gene by co- suppress ion. See, for example, van der Krol 
et al., (1990) The Plant Cell 2, 291-299; Napoli et al . , (1990) 
The Plant Cell 2, 279-289; Zhang et al . , (1992) The Plant Cell 
4, 1575-1588, and US-A- 5 , 231 , 020 . 

The complete sequence corresponding to the coding sequence 
(in reverse orientation for anti -sense) need not be used. For 
example fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
fragments of various sizes and from various parts of the coding 
sequence to optimise the level of anti-sense inhibition.. It 
may be advantageous to include the initiating methionine ATG 
codon, and perhaps one or more nucleotides upstream of the 
initiating codon. A further possibility is to target a 
conserved sequence of a gene, e.g. a sequence that is 
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characteristic of one or more genes, such as a regulatory 
sequence. Antisense constructs may involve 3 ' end or 5 'end 
sequences of Mia or homologues. In cases where several Mlo 
homologues exist in a plant species, the involvement of* 5'- and 
5 3 '-end untranslated sequences in the construct will enhance 
specificity of silencing. 

The sequence employed may be about 500 nucleotides or 
less, possibly about 400 nucleotides, about 300 nucleotides, 
about 200 nucleotides, or about 100 nucleotides. It may be 

10 possible to use oligonucleotides of much shorter lengths, 14-23 
nucleotides, although longer fragments, and generally even 
longer than about 500 nucleotides are preferable where 
possible, such as longer than about 600 nucleotides, than about 
700 nucleotides, than about 800 nucleotides, than about 1000 

15 nucleotides, than about 1200 nucleotides, than about 1400 
nucleotides, or more. 

It may be preferable that there is complete sequence 
identity in the sequence used for down -regulation of expression 
of a target sequence, and the target sequence, though total 

20 complementarity or similarity of sequence is not essential. 
One or more nucleotides may differ in the sequence used from 
the target gene. Thus, a sequence employed in a down- 
regulation of gene expression in accordance with the present 
invention may be a wild- type sequence (e.g. gene) selected from 

25 those available, or a mutant, derivative, variant or allele, by 
way of insertion, addition, deletion or substitution of one or 
more nucleotides, of such a sequence. The sequence need not 
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include an open reading frame or specify an RNA that would be 
translatable. It may be preferred for there to be sufficient 
homology for the respective anti-sense and sense RNA molecules 
to hybridise. There may be down regulation of gene expression 
even where there is about 5%, 10%, 15% or 20% or more mismatch 
between the sequence used and the target gene. 

Generally, the transcribed nucleic acid may represent a 
fragment of an Mlo gene, such as including a nucleotide 
sequence shown in Figure 2, or the complement thereof, or may 
be a mutant, derivative, variant or allele thereof, in similar 
terms as discussed above in relation to alterations being made 
to a coding sequence and the homology of the altered sequence. 
The homology may be sufficient for the transcribed anti-sense 
RNA to hybridise with nucleic acid within cells of the plant, 
though irrespective of whether hybridisation takes place the 
desired effect is down -^regulation of gene expression. 

Anti-sense regulation may itself be regulated by employing 
an inducible promoter in an appropriate construct. 

Constructs may be expressed using the natural promoter, by 
a constitutively expressed promoter such as the CaMV 35S 
promotor, by a tissue-specific or cell-type specific promoter, 
or by a promoter that can be activated by an external signal or 
agent- The CaMV 35S promoter but also the rice actinl and 
maize ubiquitin promoters have been shown to give high levels 
of reporter gene expression in rice (Fujimoto et al . , (1993) 
Bio/Technology 11, 1151-1155; Zhang, etal., (1991) Plant Cell 
3, 1155-1165; Cornejo et al . , (1993) Plant Molecular Biology 
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23, 567-581). 

For use in anti-sense regulation, nucleic acid including a 
nucleotide sequence complementary to a coding sequence of a Mlo 
gene (i.e. including homologues) , or a fragment of a said 
5 coding sequence suitable for use in anti- sense regulation of 
expression, is provided. This may be DNA and under control of 
an appropriate regulatory sequence for anti-sense transcription 
in cells of interest. 

Thus, the present invention also provides a method of 
10 conferring pathogen resistance on a plant, the method including 
causing or allowing anti-sense transcription from heterologous 
nucleic acid according to the invention within cells of the 
plant . 

The present invention further provides the use of the 
15 nucleotide sequence of Figure 2 or a fragment, mutant, 

derivative, allele, variant or homologue thereof, such as any 
sequence shown or identified herein, for down- regulation of 
gene expression, particularly down-regulation of expression of 
an Mlo gene or homologue thereof, preferably in order to confer 
20 pathogen resistance on a plant. 

When additional copies of the target gene are inserted in 
sense, that is the same, orientation as the target gene, a 
range of phenotypes is produced which includes individuals 
where over -expression occurs and some where under -express ion of 
25 protein from the target gene occurs. When the inserted gene is 
only part of the endogenous gene the number of under-expressing 
individuals in the transgenic population increases. The 
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mechanism by which sense regulation occurs, particularly 
down -regulation, is not well -understood. However, this 
technique is well-reported in scientific and patent literature 
and is used routinely for gene control. See, for example, van 
der Krol et al . , (1990) The Plant Cell 2, 291-229; Napoli et 
al., (1990) The Plant Cell 2, 279-289; Zhang et al, 1992 The 
Plant. Cell 4, 1575-1588. 

Again, fragments, mutants and so on may be used in similar 
terms as described above for use in anti-sense regulation. 

Thus, the present invention also provides a method of 
conferring pathogen resistance on a plant, the method including 
causing or allowing expression from nucleic acid according to 
the invention within cells of the plant. This may be used to 
suppress Mlo activity. Here the activity of the product is 
preferably suppressed as a result of under-expression within 

the plant cells. 

As noted, Mlo down- regulation may promote activation of a 
defence response, which may in turn confer or augment pathogen 
resistance of the plant, especially resistance to powdery 
mildew and/or rust (e.g. yellow rust) . 

Thus, the present invention also provides a method of 
modulating Mlo function in a plant, the method comprising 
causing or allowing expression from nucleic acid according to 
the invention within cells of the plant to suppress endogenous 

Mlo expression. 

Modified versions of Mlo may be used to down-regulate 
endogenous Mlo function. For example mutants, variants. 
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derivatives etc. , may be employed. For instance, expression of 
a mlo mutant sequence at a high level may out -compete activity 
of endogenous Mlo, 

Reduction of Aflo wild type activity may be achieved by 
using ribozymes, such as replication ribozymes, e.g. of the 
hammerhead class (Haseloff and Gerlach, 1988, Nature 334: 585- 
591; Feyter et al . Mol . , 1996, Gen. Genet. 250: 329-338). 

Another way to reduce Mlo function in a plant employs 
transposon mutagenesis (reviewed by Osborne et al . , (1995) 
Current Opinion in Cell Biology 7, 406-413) . Inactivation of 
genes has been demonstrated via a * targeted tagging' approach 
using either endogenous mobile elements or heterologous cloned 
transposons which retain their mobility in alien genomes. Mlo 
alleles carrying any insertion of known sequence could be 
15 identified by using PGR primers with binding specificities both 
in the insertion sequence and the Mlo homologue . *Two-element 
systems' could be used to stabilize the transposon within 
inactivated alleles. In the two-element approach, a T-DNA is 
constructed bearing a non- autonomous transposon containing 
20 selectable or screenable marker gene inserted into an excision 
marker. Plants bearing these T-DNAs are crossed to plants 
bearing a second T-DNA expressing transposase function. Hybrids 
are double-selected for excision and for the marker within the 
transposon yielding plants with transposed elements. The 
25 two-element approach has a particular advantage with respect to 
Ac/Ds of maize, as the transposed Ds is likely to be unlinked 
to the transposase, facilitating outcrossing and stabilization 
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Of the Ds insertion (aones et al..(1994) Science 266, 789-793; 
Osborne et ai . , (1995) Curorent Opinion in Cell Biology 7. 406- 
413) . 

The mlo-based powdery mildew resistance is caused by the 
inactivation of the Mlo wild type allele, resulting in a 
recessive resistance phenotype . Substances that inhibit the 
activity of the Mlo wild type protein may be used to induce the 
resistance phenotype . 

An important hint that complete inactivation of Mlo 
expression is not essential and may even be detrimental is 
provided by the description of mutagen- induced mlo resistance 
alleles that are likely to have retained residual wild type 
allele activity. These alleles exhibit no detectable 
spontaneous leaf necrosis which negatively affects 
photosynthesis rates and yield (Hentrich, W (1979) Arch. 
zachtungsvorsch. , Berlin 9. S. 283-291). 

The Mlo protein is predicted to be membrane -anchored by 
seven transmembrane helices (see e.g. Figure 7) . This 
s-ructure prediction has been reinforced by recent analysis of 
Mlo homologues in rice and Arabidopais thaliana. Structure 
prediction of the AraJbidopsis thaliana homologue also suggests 
the presence of seven transmembrane helices. A comparison of 
the Mlo homologues revealed in addition conserved cysteine 
residues in the putative extracellular loops 1 and 3 and high 
probabilities of amphipathic helices in the second 
intracellular loop adjacent to the predicted transmembrane 
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helices 3 and 4» These . conserved structural motifs in the 
family of Mlo proteins are reminiscent of G protein coupled 
receptors (GPCR) described extensively in mammalian systems. 
GPCRs are known to be activated by ligands and to amplify 
5 signals intracellularly via heterotrimeric G proteins. Without 
in any way providing a limitation on the nature or scope of any 
aspect of the present invention, it is predicted that Mlo 
activates an inhibitory G alpha subunit of heterotrimeric G 
proteins, thus leading to a downregulation of as yet unknown 

10 effector proteins. 

The provision herein of Mlo sequence information enables 
the identification of antagonists of function of the Mlo 
protein (e.g. GPCR function). Antagonists of Mlo may block 
receptor activation by its unknown genuine ligand, mimicking 

15 recessive mutations in the Mlo gene. Such Mlo antagonists may 
be used as crop protection compounds, for example applied 
externally to the plant or crop or, where the compound is 
peptidyl in nature, delivered internally via a biological 
vector (e.g. recombinant infecting viral particle expressing 

20 the antagonistic molecule within target plant cells) or via a 

transgenic route (plants or plant cells genetically modified to 
express the antagonist molecule, perhaps under control of a 
promoter inducible by an externally applied compound (eg GST-II 
promoter from maize - Jepson et al Plant Molecular Biology 

25 26:1855-1866 (1994)) allowing control over the timing of 
expresion of the mlo inactivation phenotype . 

Leaf segments of Aflo wild type plants may be tested with a 
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test substance, e.g. from a random or combinatorial compound 
library, for resistance upon challenge with pathogen such as 
powdery mildew. The detached leaf segment assay is used as a 
standard test system to score for susceptibility/resistance 
upon inoculation with powdery mildew spores. Leaf segments of 
7 -day-old seedlings of the genotype Mlo Rorl may be placed on 
agar, for example individual wells of 96 -well microtiter plates 
containing BO/xl agar. Different compoxinds may be applied to 
the agar surface in each well at a concentration of about Ippm 
dissolved in DMSO. Around seven days after inoculation of the 
detached leaf segments with pathogen, such as spores of a 
virulent powdery mildew isolate, compounds which induce 
resistance may be recognised by the absence of fungal mycelium 
on leaf segments in the microtiter plates. 

A further selection may be used to discriminate between 
compounds that act in the mlo pathway and those that confer 
resistance by other mechanisms, or those which exhibit a direct 
fungitoxic activity. For this purpose mutants in genes [Ror 
genes) which may be required for mlo resistance (Preialdenhoven 
et al., (1996), The Plant Cell 8, 5-14) may be used. Mutants 
of these genes confer c,„»r>^»tibilitv to powdery mildew attack 
despite the presence of mlo resistance alleles. Plants of the 
genotype Mlo rorl (wild type Mlo protein and defective Rorl 
gene) may be used, for example, to test compounds which induce 
resistance on Mlo Rorl genotypes but exhibit susceptibility on 
the Mlo rorl genotype, enabling selection of candidate Mlo 
antagonists. Testing candidate compounds identified using a 
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leaf segment test may be used to drastically reduce the number 
of candidate compounds for further in vitro tests. 

A further selection step of candidate antagonists may 
involve heterologous expression of the Mlo protein or a 
5 fragment thereof (e.g. in a baculovirus insect cell system) and 
subsequent binding assays with labelled molecules . Specific 
binding of compounds to cell lines expressing wild type Mlo 
protein is a good indicator of their antagonistic mode of 
action. Analsis of the deduced Mlo protein sequence has 

10 provided strong evidence that the protein is anchored in the 
membrane via seven transmembrane helices and may represent a 
novel member of the so-called serpentine receptor family. The 
conclusion is supported by the sequence data derived from 
homologous genes identified in barley, rice and Arabidopsis. 

15 Seven transmembrane proteins have been shown to be expressed at 
high level in the Baculovirus/insect cell system (up to 10*^ 
molecules per cell - Tate and Grisshamer, 1996, TIBTECH 14: 
426-430) . Since the family of Mlo proteins appears to be 
restricted to the plant kingdom, this provides a low- background 

20 environment for compound tests. Candidate compounds which are 
labelled, radioactively or non-radioactively , may be tested for 
specific binding to Sf9 insect cells expressing the Mlo protein 
after infecion with a recombinant Ipaculovirus construct. 
Specificity of the binding may be tested further by Sf9 

25 expression of mutant mlo proteins which carry characterised 

mutations (e.g. as in Table 1) leading in vivo to resistance. 
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Thus, in various further aspects the present invention 
relates to assays for substances able to interfere with Mlo 
function, i.e. confer a mla mutant phenotype, such substances 
themselves and uses thereof. 

The use of Mlo in identifying and/or obtaining a substance 
which inhibits Mlo function is further provided by the present 
invention, as is the use of Mlo in identifying and/or obtaining 
a substance which induces pathogen resistance in a plant. 

Agents useful in accordance with the present invention may 
be identified by screening techniques which involve determining 
whether an agent under test inhibits or disrupts Mlo function 
to induce an inio phenotype. Candidate inhibitors are 
substances which bind Mlo. 

It should of course be noted that references to "Mlo" in 
relation to assays and screens should be taken to refer to 
homologues, such as in other species, including rice and wheat, 
not just in barley, also appropriate fragments, variants, 
alleles and derivatives thereof- Assessment of whether a test 
substance is able to bind the Mlo protein does not necessarily 
require the use of full-length Mlo protein. A suitable 
fragment may be used (or a suitable analogue or variant 
thereof) . 

Suitable fragments of Mlo include those which include 
residues known to be crucial for Mlo function as identified by 
mlo mutant alleles (Table 1). Smaller fragments, and analogues 
and variants of this fragment may similarly be employed, e.g. 
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as identified using techniques such as deletion analysis or 
alanine scanning. 

Furthermore, one class of agents that can be used to 
disrupt Mlo activity are peptides fragments of it. Such 
peptides tend to be short, and may be about 40 amino acids in 
length or less, preferably about 35 amino acids in length or 
less, more preferably about 30 amino acids in length, or less, 
more preferably about 25 amino acids or less, more preferably 
about 20 amino acids or less, more preferably about 15 amino 
acids or less, more preferably about 10 amino acids or less, or 
9, 8, 7, 6, 5 or less in length. The present invention also 
encompasses peptides which are sequence variants or derivatives 
of a wild type Mlo sequence, but which retain ability to 
interfere with Mlo function, e.g. to induce an mio mutant 
phenotype. Where one or more additional amino acids are 
included, such amino acids may be from Mlo or may be 
heterologous or foreign to Mlo. A peptide may also be included 
within a larger fusion protein, particularly where the peptide 
is fused to a non-Mlo(i.e. heterologous or foreign) sequence, 
such as a polypeptide or protein domain. 

Peptides may be generated wholly or partly by chemical 
synthesis. The compounds of the present invention can be 
readily prepared according to well-established, standard liquid 
or, preferably, solid-phase peptide synthesis methods, general 
descriptions of which are broadly available <see, for example, 
in J-M. Stewart and J.D. Young, Solid Phase Peptide Synthesis, 
2nd edition. Pierce Chemical Company, Rockford, Illinois 
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.1984), in M. Bodanzsky and A, Bodanzsky, The Practice of 
Peptide Synthesis, Springer Verlag, New York (1984); and 
Applied Biosystems 430A Users Manual, ABI Inc., Foster City, 
California) , or they may be prepared in solution, by the liquid 
phase method or by any combination of solid-phase, liquid phase 
and solution chemistry, e.g. by first completing the respective 
peptide portion and then, if desired and appropriate, after 
removal of any protecting groups being present, by introduction 
of the residue X by reaction of the respective carbonic or 
sulfonic acid or a reactive derivative thereof . 

Another convenient way of producing a peptidyl molecule 
according to the present invention (peptide or polypeptide) is 
zo express nucleic acid encoding it, by use of nucleic acid in 
an expression system, as discussed elsewhere herein. This 
allows for peptide agents to be delivered to plants 
-ransgenically, by means of encoding nucleic acid. If coupled 
::o an inducible promoter for expression under control of the 
user, this allows for flexibility in induction of an mla 
phenotype and pathogen resistance. This may allow for any 
side-effects arising from interference with Mlo function to be 
T^oderated. 

In one general aspect the present invention provides an 
assay method for a substance able to interact with the relevant 
region of Mlo, the method including: 

(a) bringing into contact a Mlo polypeptide or peptide 
fragment thereoof , or a variant, derivative or analogue 
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thereof, and a test compound; and 

(b) determining interaction or binding between said 
polypeptide or peptide and the test compound. 

A test compound found to interact with the relevant 
5 portion of Mlo may be tested for ability to modulate, e.g. 

disrupt or interfere with, Mlo function, as discussed already 
above . 

Another general aspect of the present invention provides 
10 an assay method for a substance able to induce an mla mutant 
phenotype in a plant, the method including: 

(a) bringing into contact a plant or part thereof (e.g. 
leaf or leaf segment) and a test compound; and 

(b) determining Mlo function and/or pathogen resistance 
15 and/or stimulation of a defence response in the plant. 

Susceptibility or resistance to a pathogen may be 
determined by assessing pathogen growth, e.g. for powdery 
mildew the presence or absence, or extent, of mycelial growth. 

Binding of a test compound to a polypeptide or peptide may 
20 be assessed in addition to ability of the test compound to 

stimulate a defence response in a plant. Such tests may be run 
in parallel or one test may be performed on a substance which 
tests positive in another test. 

25 Of course, the person skilled in the art will design any 

appropriate control experiments with which to compare results 
obtained in test assays. 
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Performance of an assay method according to the present 
invention may be followed by isolation and/or manufacture 
and/or use of a compound, substance or molecule which tests 
positive for ability to modulate Mlo function and/or induce 
pathogen resistance, such as resistance to powdery mildew. 

The precise format of an assay of the invention may be 
varied by those of skill in the art using routine skill and 
knowledge. For example, interaction between substances may be 
studied in vitro by labelling one with a detectable label and 
bringing it into contact with the other which has been 
immobilised on a solid support. Suitable detectable labels, 
especially for peptidyl substances include «s_methionine which 
may. be incorporated into recorabinantly produced peptides and 
polypeptides. Recombinant ly produced peptides and polypeptides 
may also be expressed as a fusion protein containing an epitope 
which can be labelled with an antibody. 

An assay according to the present invention may also take 
the form of an in vivo assay. The in vivo assay may be 
performed in a cell line such as a yeast strain or mammalian 
cell line in which the relevant polypeptides or peptides are 
expressed from one or more vectors introduced into the cell. 

For example, a polypeptide or peptide containing a 
fragment of Mlo or a peptidyl analogue or variant thereof as 
disclosed, may be fused to a DNA binding domain such as that of 
the yeast transcription factor C3AL 4 . The GAL 4 transcription 
factor includes two functional domains. These domains are the 
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DNA binding domain (GAL4DBD) and the GAL4 transcriptional 
activation domain {GAL4TAD) . By fusing such a polypeptide or 
peptide to one of those domains and another polypeptide or 
peptide to the respective counterpart, a functional GAL 4 
5 transcription factor is restored only when two polypeptides or 
peptides of interest interact. Thus, interaction of the 
polypeptides or peptides may be measured by the use of a 
reporter gene probably linked to a GAL 4 DNA binding site which 
is capable of activating transcription of said reporter gene. 

10 This assay format is described by Fields and Song, 1989, Nature 
340; 245-246. This type of assay format can be used in both 
mammalian cells and in yeast. Other combinations of DNA 
binding domain and transcriptional activation domain are 
available in the art and may be preferred, such as the LexA DNA 

15 binding domain and the VP60 transcriptional activation domain. 
When looking for peptides or other substances which 
interact with Mlo, the Mlo polypeptide or peptide may be 
employed as a fusion with (e.g.) the LexA DNA binding domain, 
with test polypeptide or peptide (e.g. a random or 

20 combinatorial peptide library) as a fusion with (e.g.) VP60. 
An increase in reporter gene expression (e.g. in the case of 
galactosidase a strengthening of the blue colour) results from 
the presence of a peptide which interacts with Mlo, which 
interaction is required for transcriptional activation of the 

25 /S-galactosidase gene. 



The amount of test substance or compound which may be 
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added to an assay of the invention will normally be determined 
by trial and error depending upon the type of compound used. 
Typically / from about 0,001 nM to ImM or more concentrations of 
putative inhibitor compound may be used, for example from 0.01 
nM to IOOmM, e.g. 0.1 to 50 ^M, such as about 10 fiM. Greater 
concentrations may be used when a peptide is the test 
substance. Even a molecule which has a weak effect may be a 
useful lead compound for further investigation and development. 

Compounds which may be used may be natural or synthetic 
chemical compounds used in drug screening programmes. Extracts 
of plants which contain several characterised or 
uncharacterised components may also be used. Antibodies 
directed to Mlo or a fragment thereof form a further class of 
putative inhibitor compounds. Candidate inhibitor antibodies 
may be characterised and their binding regions determined to 
provide single chain antibodies and fragments thereof which are 
responsible for disrupting the interaction. Other candidate 
inhibitor compounds may be based on modelling the 3 -dimensional 
structure of a polypeptide or peptide fragment and using 
rational drug design to provide potential inhibitor compounds 
with particular molecular shape, size and charge 
characteristics. It is worth noting, however, that 
combinatorial library technology provides an efficient way of 
testing a potentially vast number of different substances for 
ability to interact with and/or modulate the activity of a 
polypeptide. Such libraries and their use are known in the 
art, for all manner of natural products, small molecules and 
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peptides, among others. The use of peptide libraries may be 
preferred in certain circumstances. 

Following identification of a substance or agent which 
5 modulates or affects Mlo function, the substance or agent may 
be investigated further. Furthermore, it may be manufactured 
and/or used in preparation, i.e. manufacture or formulation, of 
a composition for inducing pathogen resistance in a plant. 
These may be applied to plants, e.g. for inducing pathogen 

10 resistance, such as resistance to powedery mildew. A further 
aspect of the present invention provides a method of inducing 
pathogen resistance in a plant, the method including applying 
such a substance to the plant. A peptidyl molecule may be 
applied to a plant transgenically , by expression from encoding 

15 nucleic acid, as noted. 

A polypeptide, peptide or other substance able to modulate 
or interfere with Mlo function, inducing pathogen resistance in 
a plant as disclosed herein, or a nucleic acid molecule 
20 encoding a peptidyl such molecule, may be provided in a kit, 

e.g. sealed in a suitable container which protects its contents 
from the external environment. Such a kit may include 
instructions for use. 

25 Further aspects and embodiments of the present invention 

will be apparent to those skilled in the art. The present 
invention will now be exemplified by way of illustration with 
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reference to the following figures : 

Figure 1 Positional Cloning of Mlo. The Mlo locus has 
been mapped with increasing precision on the long arm of barley 
chromosome 4 using morphological, RFDP and AFLP markers. The 
upper part of the figure presents the genetic linkage maps of 
these markers relative to Mlo. All genetic distances are 
indicated in centiMorgan (cM) based on mult i -point linkage 
analysis except for genetic distances between AFLP markers 
which are calculated by two-point-estimates. The morphological 
marker map (Jorgensen, 1977) positions Mlo at a distance of 
more than 2 0 cM to hairy leaf sheath (Hs) and glossy 
sheath/spike (gsl) . The RFLP marker map is based on the 
analysis of 257 individuals derived from the cross Carlsberg 
II Mlo Grannenlose Zweizeilige mlo-11. The previously 
published RFLP map (Hinze et al . . 1991) of the same cross was 
based on only 44 F2 individuals. The gene was delimited to a 
2.7 CM interval bordered by markers bAOll and bAL88. AFLP 
markers were identified and mapped as described in Experimental 
procedures. Their genetic distance to Mlo is based on the 
cross Ingrid Mlo x BC^Ingrid inlo-3. The crucial result of the 
AFLP analysis has been the identification of two markers, Bpm2 
and Bpm9, defining an 0.64 cM interval containing the Mio locus 
and one marker (Bpml6) cosegregating with Mlo on the basis of 
more than 4,000 meiotic events. Marker Bxm2 which is located 
0.1 cM telomeric to Aflo was derived from BAG F15 template DNA 
(see below). One YAC clone, YAC YHV303-A6, containing the 
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cosegregating marker Bptnie and two flanking loci {BpTn2 and 
Bpm9) , is shown in the middle section of the figure. The 
position of marker Bpm9 was only roughly estimated within the 
YAC clone as indicated by the arrow. The insert of BAG F15 
5 represents a 60 kb subfragment of this YAC as indicated in the 
lower part of the Figure. After the identification of AFLP 
marker Bpm2 in BAG F15, marker Bxm2 was discovered and 
positioned 0.1 cM in telomeric orientation of Mlo. The 
approximate physical position of AFLP markers Bpm2, Bpmi6, and 

10 Bxm2 (spanning an inteirval of approximately 3 0 kb) as well as 
the location of some rare occurring restriction sites are 
indicated. Dashed lines below the schematic representation of 
BAG F15 DNA show the position of the largest established DNA 
sequence contigs. The structure of the Mlo gene is given 

15 schematically in the bottom line of the Figure. Exons are 

highlighted by black boxes. Positions of mutational events are 
indicated for the eleven tested mlo alleles. Mutant alleles 
carrying deletions in their nucleotide sequence are marked with 
a a; the remaining mutant alleles represent single nucleotide 

20 substitutions resulting in amino acid exchanges in each case . 

Figure 2 shows an Mlo coding sequence and encoded amino 
acid sequence according to the present invention. The amino 
acid sequence predicted from DNA sequences of RT-PCR products 
from Ingrid Mlo are shown. Nucleotide numbers are given 

25 according to translational start site. 

Figure 3 Northern Blot Analysis of Mlo Transcript 
Accumulation. Total RNA (20 /ig) and poly (A) RNA (5 ^g) of 
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seven-day-old uninfected barley primary leaves of one wild type 
(cultivar Ingrid Mlo) and two mutant <BC Ingrid mlo-l, BC 
Ingrid jnlo-3) cultivars were isolated, separated on a 1.2% 
formaldehyde gel and transferred to a nitrocellulose membrane 
(Hybond) . The filter was probed under stringent conditions 
(Sambrook et al . , 1989) with the radioactivity labelled full 
size RT-PCR product derived from Ingrid ATlo (Figure 7) . A 
clear signal is detected only in the lanes containing poly (A) + 
RNA. The signal corresponds to a size of approximately 2 kb. 

Figure 4 Southern Blot Analysis of Intragenic Recombinants 
derived from mlo heteroallelic crosses. The alleles of two 
RFLiP markers flanking Mlo on opposite sides of either 
susceptible F2 individuals or homozygous susceptible and 
homozygous resistant progeny were determined by Southern blot 
analysis. Plant DNA (10 /ig) of the individuals were digested 
with P8t 1 (A) or Hae III (B) and hybridized with the 
radioactively labelled RFLP markers WG114 (upper panel; maps 
3.1 cM in centromeric orientation to Mlo; see Figure 1) and 
ABG366 (lower panel; maps 0.7 cM in telomeric orientation to 
Mlo: see Figure 1) according to standard procedures (Sambrook 
et al . , 1989) . 

A DNA of the parental lines mlo -8 and mlo-1 and two 
homozygous susceptible (S, Mlo Aflo) and two resistant (R, mlo 
mlo) progenies derived from two susceptible plants 
(designated 1 and 2) were tested. The DNAs in lanes S and R 
represent selection F3 individuals from F3 families obtained by 
selfing the susceptible F2 individuals 1 and 2. Note that 
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susceptible individuals are expected to be heterozygous at 
Mlo in this section scheme. Infection phenotypes were scored 
seven days after inoculation with the mlo avirulent isolate Kl . 
DNA from a third susceptible individual of this heteroallelic 
cross (see Table 7) is not included in this Figure - 

B DNA of the parental lines mlo-S and /nio-l and seven 
homozygous susceptible (S, Mlo Mlo) and seven resistant (R, mlo 
mlo) progeny derived from seven susceptible Fj plants 
(designated 1 to 7) were tested. The DNAs in lanes S and R 
represent selected F3 individuals from F3 families obtained by 
selfing the susceptible F2 individuals l to 7 . DNA was 
analyzed from two further susceptible individuals of this 
heteroallelic cross only in the Fj generation (8* and 9*) . 

Figure 5 shows an alignment of genomic sequences covering 
the barley Mlo gene and a rice homologue isolated via 
crosshybridization with a barley gene specific probe. The top 
line shows the barley Mlo genomic DNA sequence (exon sequences 
underlined) . The bottom line shows the rice genomic sequence 
containing the rice Mlo homologue. 

Figure 6 shows an alignment of genomic sequences carrying 
the barley Aflo gene and a barley homologue isolated via 
crosshybridization with a barley gene specific probe. The top 
line shows the barley Mlo genomic DNA sequence (exon sequences 
underlined) . The bottom line shows the genomic sequence 
containing the barley Mlo homolocfue . 

Figure 7 Nucleotide and Deduced Amino Acid Sequence of the 
Barley Mlo cDNA, The nucleotide and the deduced amino acid 
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sequence are based on the combined data of RT-PCR and RACE 
obtained from experiments using RNA of cultivar Ingrid Mlo. 
The stop codon is marked by an asterisk, the putative 
polyadenylation signal is underlined and the detected termini 
of RACE products are indicated by arrows above the sequence. 
Positions of introns as indentified by comparison with 
corresponding genomic clones are labelled by triangles below 
the nucleic acid sequence. Six predicted transmembrane 
spanning helices according to the MEMS AT algorithm {Jones et 
al,, 1994) are boxed in grey colour. A putative nuclear 
localization signal (K-K-K-V-R) and casein kinase II site (S-I- 
F-D) in the carboxy- terminal half of the protein are shown in 
bold type . 

Figure 8 shows genomic sequence of rice (Oryza sativa) 
homologue including coding and flanking secjuences . 

Figure 9 shows genomic sequence of barley (Hordeum 
vulgare) homologue including coding and flanking sequences. 

Figure 10 shows cDNA sequence of rice homologue. 

Figure 11 shows cDNA sec[uence of barley homologue. 

Figure 12 shows cDNA sequence of Arabidopsis thaliana 
homologue . 

Figure 13 shows amino acid sequence of rice homologue . 
Figure 14 shows amino acid sequence of barley homologue. 
Figure 15 shows amino acid sequence of Arabidopsis 
homologue . 

Figure 16 shows a pretty box of amino acid sequences of 
Mlo, barley, rice and Arabidopsis homologues. 
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All documents mentioned in this document are incorporated 
by reference. 

EXAMPLE 1 - CLONING OF MLO OF BARLEY 

Targeted search for AFLP markers tightly linked to Mlo 

Efforts to increase the DNA marker density around Mlo were 
coordinated with attempts to construct a local high resolution 
genetic map. An alternative possibility would have been to 
extend the population size of the characterized cross Carlsberg 
II Mlo X Grannenlose Zweizeilige mlo-ll {Hinze et al • , 1991) 
but it was felt to be advantageous to establish a high 
resolution map starting out from one of the available 
BC mio lines and its recurrent parent line. Importantly, the 
donor parent of the BC line represents a different genetic 
background in comparison to the recurrent parent line. In this 
way, searching for linked AFLP markers could be started in 
parallel with generating a large mapping population from a 
cross between the same genetic lines. In addition, the BC line 
based cross allowed testing of colinearity of DNA markers in 
the vicinity of Mlo as determined from the cross 
Carlsberg II Mlo x Grannenlose Zweizeilige mlo-ll (Hinze et 
al . , 1991). For the new cross a inlo-3 backcross (BC) line was 
used that had been backcrossed seven times into the genetic 
background Ingrid (BC7 Ingrid inlo-3 ; Hinze et al . , 1991). The 
line was previously characterized to carry a relatively small 
introgressed DNA segment on barley chromosome 4 . In addition, 
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the donor parent line Malteria Heda mla-2 exhibits in 
comparison to DNA from the recurrent parent Ingrid 
polymorphisms with most of the identified RFLP loci linked to 
Mlo. Thus, by searching polymorphisms only between two DNA 
templates, from lines Ingrid Mlo and BC^ Ingrid jnlo-3, we hoped 
to increase the density of DNA markers with AFLPs around Mlo in 
a targeted manner. 

The same two lines were crossed to establish a segregating 
population for high resolution mapping of DNA markers, formally 
representing an eigth backcross. F2 individuals were scored for 
mlo resistance after powdery mildew inoculation with isolate Kl 
(virulent on Ingrid Mlo and avirulent on BC^ Ingrid ) • 

Initially, only a small fraction of the Fj (77 individuals) was 
analyzed for recombination events with flanking RFLP markers • 
Analysis of four identified recombinants (designated 8-32-2, 7- 
38-4, 1-34-1, and 1-49-4) indicated colinearity of marker order 
in this cross compared to the previously analyzed cross 
Carlsberg II Mlo x Grannenlose Zweizeilige /nio-ll (Hinze et 
al,, 1991) . Several of the 77 Fj seedlings which exhibited a 
susceptible phenotype and heterozygosity for the tested 
flanking DNA marker loci (bAOll, bAL88/2, and bAP91; Hinze et 
al., 1991) were grown to maturity to provide further selfed 
seed material segregating for Mlo/mlo-3 in the F3 generation. 
In total, leaf material was harvested for high resolution 
marker mapping from 2,026 individuals derived from either the 
selfed F2 or F3 generation. 

AFLP marker candidates were identified by testing all 
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possible Pst 1/Mse I primer combinations (1,024) extending into 
genomic sequences up to nucleotide positions +2 and +3, 
respectively. Similarly, almost 1,900 Eco Rl/Mse I primer 
combinations (+3/+3) have been analyzed. Four DNA templates 
5 were included in this analysis: Ingrid Mia, BC^ Ingrid mla'3 , a 
DNA pool of two phenotypically mlo resistant individuals, 
and a DNA pool of nine phenotypically susceptible F2 
individuals. The resistant and susceptible F2 individuals which 
were included as DNA pools in the AFLP search had been selected 

10 from the above mentioned RFLP analysis of 77 Fj segregants. The 
pooled F2 DNA enabled us to control whether candidate 
polymorphisms detected between template DNA from the parents 
were heritable traits in the F2 . All identified AFLP candidate 
markers have been re-examined with eight DNA templates: Ingrid 

15 Mlo, BCj Ingrid mla-3 , DNA pools from individuals of three F3 
families which were phenotypically homozygous susceptible 
[MloMlo) according to Kl inoculation experiments; DNA of three 
resistant Fj individuals, A total of 18 Pst I/Mse I and 20 
Eco Kl/Mse I primers were confirmed based on the selection 

20 procedure. 

The number of identified AFLP markers made it useful to 
assign them first roughly to marker intervals based on the RFLP 
map around Wlo. It was hoped that this approach should enable 
both evaluation of the distribution of AFLPs among previously 

25 identified RFLP intervals close to Mlo and selection of a pair 
of flanking AFLP markers with which recombinants could be 
identified among the 2,026 segregants. For AFLP assignment we 



wo 98/04586 PCT/GB97/02046 

69 

used those four recombinants that had been identified with RFLP 
markers out of the above mentioned small sample of 77 
segregants from Ingrid Mlo x BC7 Ingrid mlo-J (two recombinants 
in interval bAP91-bALi88 , one in ATlo-bAOll, and one in bAOll- 
5 ABG366) . A total of 18 AFLPs were found to be located within a 
genetic distance of approximately 3.5 cM including Mlo. 

Cons tzruat ion of a high resolution AFJbP map around Mlo 

A two-step procedure was used to construct the high 

10 resolution AFLP map. First, all 2,026 segregants were screened 
for recombination events between two AFLP markers on opposite 
sides of Aflo and subsequently only the few identified 
recombinants were used to map all the identified AFLPs in the 
3.5 CM target interval . AFLP markers Bpml and Bpm9 were 

15 chosen, detecting each allelic DNA fragments in Ingrid Mlo and 
BC7 Ingrid mlo-3 and located on opposite sites of Mlo to screen 
DNA templates of the segregants for recombination events. 
Alternatively, the search for recombinants could have been 
carried out with the flanking RFLP markers bAOll and bAL88. 

20 However, although the conversion into cleaved amplified 

polymorphic sites (CAPS) was successful for both markers, 
difficulties to display the alleles of both loci simultaneously 
from crudely purified genomic DNA were encountered. A total of 
2,026 individuals (F2 or F3 segregants) were screened 

25 simultaneously with AFLP markers Bpml and Bpm9 and 98 

recombinants were identified. AFLP analysis was subsequently 
carried out with each of the 98 DNA templates of the 
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recombinants to identify the alleles of each of the identified 
of AFIjP loci. The recombinants have been selfed and 
inoculation experiments with powdery mildew isolate Kl were 
performed using at least 2 5 individuals of each recombinant 
family to deduce the alleles of the previous generation at the 
Mlo locus. The obtained datia enabled the construction of a high 
resolution map around Mlo based on more than 4,000 meiotic 
events and a resolution of at least 0.025 cM derived via two- 
point estimates. The essential result has been the 
identification of a DNA marker cosegregating with Mlo (Bpml6) 
and two flanking markers (Bpm2 and Bpm9) at a distance of 0.25 
and 0.4 cM respectively (Figure 1). 

Construction of a large insert size harley YAC lliDrary, 
isolation of Bpml6 containing YACs, and physical delimitation 
of Mlo 

The genetic evidence indicates that jnlo resistance is due 
to loss of function in the MJo wild type allele. Therefore, it 
was decided to establish a large insert size YAC library from 
cultivar Ingrid Mlo into vector pYAC4 (Burke et al . , 1987; 
Hieter, 1990) . Megabase DNA suitable for YAC cloning 
experiments was prepared in mg amounts from mesophyll 
protoplasts of five-day-old seedlings according to a modified 
protocol described by Siedler and Graner (1991) . The DNA was 
partially digested with Eco RI in the presence of Eco RI 
methyltransf erase to obtain DNA fragments after preparative 
pulsed-field gel electrophoresis (PFGE) in the size range of 
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500-600 kb. After ligation with Eco RI digested pYAC4 , the DNA 
was transformed into yeast strain AB13 8 0 and colonies carrying 
recombinant pYAC4 DNA were selected on solidified synthetic 
complete medium lacking tryptophan and uracil (Sherman et al . , 
1986) • Forty randomly selected yeast colonies were tested for 
the presence of barley DNA using labelled barley genomic DNA in 
Southern experiments. The size of the YAC inserts was found 
after PFGE separations to vary between 500 and 800 kb- On 
average a genetic distance of 0.2 cM was expected to be 
represented on the individual recombinant YAC clone. A total of 
-40,000 clones representing four barley genome equivalents have 
been generated. 

Four YAC clones (designated 303A6, 322G2, 400H11, and 
417D1) have been isolated with marker Bpml6 cosegregating with 
Mlo. Their insert size was determined by PFGE to be 650, 710, 
650, and 820 kb respectively. AFLP analysis had shown that 
three of these clones (303A6, 322G2, and 417D1) contain also 
both flanking marker loci whereas clone 4 0 0H11 contains only 
loci Bpml6 and Bpm2 . These findings strongly suggested that the 
Mia gene had been physically delimited on recombinant YAC 
clones 303A6, 322G2, and 417D1. 

YAC 3 03A6 was chosen for subcloning experiments into. BAC 
vector pECSBAC4 containing a unique Eco RI site (Shizuya et 
al., 1992; the vector pECSBAC4 is described by Frijters and 
Michelmore, 1996; submitted). Total yeast DNA of this clone was 
partially digested with Eco RI to obtain DNA fragments with an 
average size of 50 kb and ligated into Eao RI digested and 
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dephosphorylated BAG vector- Bacterial colonies containing 
YAC 303A6-derived DNA in pECSBAC4 were identified by replica 
colony hybridization experiments. One set of colony containing 
membranes was hybridized with labelled yeast AB1380 DNA and the 
replica set was hybridized with labelled PFGE-purif ied YAC303A6 
DNA. Recombinant BAG clones containing the AFLP locus Bpml6 
were subsequently identified using the cloned 108 bp 
Pst 1/Mse I genomic Bpml6 fragment as a probe in colony 
hybridization experiments. 

One BAG clone, BAG F15, containing an insert of - 60 )cb 
was chosen for further detailed studies. It was found that the 
recombinant BAG clone contained in addition the AFIiP marker 
locus Bpm2, but not Bpmd . At this point the BAG F15 insert DNA 
indicated successful physical delimitation in telomeric 
orientation but it was an open question whether the insert 
would contain bordering sequences in centromeric direction. 
Instead of constructing a BAG contig between Bpm 16 and Bpm9, 
the option to develop new polymorphic markers from BAG F15 was 
chosen. An allelic Xba I/Msg I polymorphism {designated Bxm2) 
was identified between the parental lines Ingrid Mia and 
BG7 Ingrid mla-3 . 

An analysis of the 25 recombinant individuals carrying 
recombination events within the Mia containing interval Bpm2- 
Bpm9 enabled mapping of Bxm2 in centrometric orientation at a 
distance of 0.1 cM from M2o- Only four out of the 16 available 
recombinants in the interval Bpm9-Afio and none of the 9 
recombinants in the interval Mlo-Bpm2 were found to exhibit a 



wo 98/04586 



PCT/GB97/02046 



73* 

recombination event between Bxm2 and Aflo. It was concluded 
that Mlo had been physically delimited on BAG F15 between 
marker loci Bpm2 and Bxm2 (Figure 1) , 

Identification of the Mlo gene and mlo mutants 

A random sequencing project was initiated to determine 
sequence contigs of the -60 kb insert of BAG F15 before marker 
Bxm2 was identified and shown to delimit the gene in telomeric 
orientation. In parallel, a physical map was generated 
(Figure 1) . The physical map indicated that the flanking 
markers Bpm2 and Bxm2 are physically separated by --30 kb. The 
sequence contigs were searched for regions of high coding 
probability using the UNIX versions of the STADEN program 
package. Only one sequence contig of almost 6 kb, including the 
cosegregating marker Bpml6, revealed an extensive region of 
high coding probability. 

RT-PGR reactions were performed with total leaf RNA 
derived from cultivar Ingrid Mlo using a series of primers 
deduced from regions which indicated high coding probabilities 
and obtained in each case a distinct amplification product. 
Sequencing of the largest RT-PGR products revealed a single 
extensive open reading frame of 1,602 bp (Figure 2). The 
deduced putative protein of 533 amino acids has a molecular 
weight of 60.4 kDal . The -1.7 kb RT-PCR product was used as a 
hybridization probe and detected a single RNA transcript of 
-1,9 kb length. (Figure 3) . A comparison of the genomic 
sequence and the largest RT-PGR fragment reveals 12 exons and 
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11 introns, each flanked by the characteristic splice site 
sequences (Figure 1) . 

Because marker Bpml6 is located at the 3' end of the above 
described gene (exon 11) and cosegregates with the Mlo locus, 
we started a direct PGR sequencing of the various available 
mutagen- induced mlo resistance alleles. We identified in 14 
out of 15 tested mutant alleles nucleotide alterations which 
result either in single amino acid alterations, deletions or 
frame shifts of the wild type sequence (Table 1) . We suspect 
that mutant allele jnlo-2 is located within the promoter- or 5' 
untranslated sequences. The region is notoriously difficult to 
be sequenced via direct PGR sequencing from genomic DNA 
templates but experiments using a series of nested primers are 
likely to solve this problem. In summary, the comparative 
sequencing of genomic DNA from various mutant mlo lines and 
their respective Mlo wild type ciltivars provided strong 
evidence that Mlo has been identified, 

Jntragenic recomJDlnants 

It had been the intention to provide a chain of evidence 
for the molecular isolation of Mlo which did not rely upon 
complementation experiments via transgenic barley plants. We 
had chosen to develop an unusual genetic tool to confirm that 
the identified gene represented Mlo. It was reasoned that if 
the mutations observed in the above described gene caused 
resistance to the powdery mildew fungus, recombination events 
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between mutant allele sites should restore wild type sequences. 

It was predicted that those intragenic recombinants would 

exhibit susceptibility upon powdery mildew attack. 

A crossing scheme was devised involving mlo resistance 

alleles mlo-1, mlo-5, and mlo-fl. The mutant alleles originate 

from the genetic backgrounds Haisa (ialo-1) and Carlsberg II 

{mlo-5 and mlo-8) . Interrautant crosses were performed as shown 

in Table 2 generating in each case at least 10 plants. F2 

populations were obtained by self-fertilization. seedlings 

were screened for rare susceptible individuals after 

inoculation with powdery mildew isolate Kl which is virulent on 

each of the parental Mlo wild type cultivars. Susceptible Fa 

-4 

individuals were identified with a frequency of ~6 x 10 . In 
contrast, if comparable numbers of progenies from selfings of 
each of the mlo mutants were tested for resistance to Kl, no 
susceptible seedling was identified. This finding strongly 
indicated that the majority of the susceptible individuals 
derived from the intermutant crosses were not due to 
spontaneous reversion events of the mutant mlo alleles. 

Inheritance of the susceptible Fj individuals was tested 
after selfing in F3 families. Each of the F2 individuals 
segregated susceptible and resistant F3 individuals indicating 
hetrozygosity for alleles conferring resistance/susceptibility 
in the Fa- Homozygous susceptible F3 progeny were isolated for 
the majority of susceptible Fj individuals by selfing of F3 
individuals and subsequent identification of F4 families in 
which only susceptible individuals were detected. 
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A molecular analysis of the susceptible individuals has 
been performed using RFLP markers known to be tightly linked 
(< 3 cM) on each side of the Mia locus (Figure 4) . RFLP marker 
WG114 maps in centromeric orientation relative to Mlo, marker 
ABG366 maps in the direction of the telomere. Detected RFLP 
alleles are shown for the intermutant crosses mlo-S x mlo-1 (A) 
and mlo-i x jnlo-5 (B) . DNA was analyzed either from susceptible 
F2 individuals (indicated by ♦) or from homozygous susceptible 
(S) and homozygous resistant (R) F3 progeny obtained from 
selfed susceptible F2 individuals. 

The homozygous susceptible F3 progeny from the susceptible 
F2 plant #1 of cross 

mlo-8 X mlo-l (Figure 4) reveals the WG114 allele derived from 
the inlo-l parent in centromeric orientation next to Mia and the 
ABG366 allele from the mla-8 parent in telomeric orientation to 
Mlo. The homozygous resistant F3 progeny from F2 plant #1 of 
this cross reveals in contrast only the flanking marker alleles 
derived from parent mla-1. The finding strongly suggested that 
susceptibility in Fj plant #1 is caused by a cross-over type of 
recombination in the preceding meiosis of one chromosome which 
results in a restoration of the Mia wild type allele whereas 
the second Fj chromosome of individual 1 contains a 
functionally unaltered mla-1 allele. The allelotypes. of the 
RFLP loci of the homozygous susceptible F3 progeny from 
susceptible F2 plant #2 are identical to the one described 
above. However, flanking marker alleles from the homozygous* 
resistant F3 progeny of this individual are in both cases 
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derived from the mJo- 5 parent. It is concluded that again a 
cross-over type of recombination restored one Mlo wild type 
allele in the susceptible individual #2. 

Nine susceptible F2 individuals were recovered from the 
cross ntlO'l x jnla-5 (Figure 4) . For susceptible Fj individuals 
#1 to #7 both homozygous susceptible and homozygous resistent 
F3 progeny were analyzed at the DNA level. Note that only DNA 
from the heterozygous susceptible F2 individuals was analyzed 
in the case of individuals #8 and #9 (marked by a *) • The 
following allele patterns with respect to the flanking RFLP 
loci were observed: (i) homozygous resistant F3 progeny showed 
on both sides of Mlo either only the allelotypes of loci WG114 
and ABG3 66 derived from the mlo-1 parent (individuals #1, #3, 
#6, #7) or only the allelotypes derived from the mlo-5 parent 
(individuals »2 , #4, #5). (ii) Homozygous susceptible F3 
progeny showed in contrast either only the allelotypes of both 
loci derived from the mlo-.S parent (no. #3/ #5, #6) or they 
showed different allelotypes on both sides of Mlo (individuals 
#1, #2, #4, #7) . {1x1) The homozygous susceptible F3 progeny 
with different allelotypes on both sides always contain in 
centromeric orientation the mlo-l derived WG114 allele and in 
telomeric orientation the mlo-^S derived ABG366 allele, (iv) The 
heterozygous susceptible Fj individual #8 reveals on either 
side next to Mlo only the alleles derived from parent /nlo-5. 
The heterozygous susceptible individual #9 reveals in 
centromeric orientation alleles derived from both parents mlo-I 
and wlo-5 whereas only the mlo-5 derived allele is detected in 
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telomeric orientation. A comprehensive interpretation of the 
data suggests that susceptibility in individuals no. #1, #2, 
#4, #7, and #9 is caused by a cross-over type of recombination 
restoring the Mlo wild type allele. Non cross-over types of 
5 recombination may have restored the Mlo wild type allele in 
individuals no. #3, #5, #6, and #8. 

A compilation of the detected flanking RFLP alleles of all 
isolated susceptible individuals or homozygous F3 progeny is 
shown in Table 3. Note that individual #3 of the cross mlo-8 x 
10 mlo-l is not shown in Figure 4. The compilation reveals that 

(i) cross-over types of recombination (CO) and non cross-over 
types of recombination (NCO) are found with a ratio of 7 : 5, 

(ii) cross -over types of recombination are resolved 
unidirectional, and (iii) NCO recombinants were not observed 

15 with parental mJo-l-linked RFLP alleles. 

The CO type intragenic recombinants isolated from 
heteroallelic mlo crosses were used to test whether wild type 
sequences of the Mlo candidate gene had been restored. For the 
three relevant alleles mlo-l, mlo-^S, mlo-S alleles candidate 

2 0 mutation sites have been identified (Table 1 and 4) . Direct 
PCR sequencing of genomic DNA of susceptible intragenic 
recombinants derived from both heteroallelic crosses mlo-l x 
jnlO'8 and mlo-1 x mlo-S revealed restoration of wild type 
sequences (Table 4) . This observation strongly suggests that 

25 the intragenic cross over event occurred between nucleotide -1 
and +483. in the former and +3 and +483 in the latter cross 
(according to translat ional start site) , Thus, the molecular 
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analysis of seven intragenic recombinants from two 
heteroallelic crosses provides final proof that the above 
described candidate gene represents Mlo. 

EXAMPLE 2 - HOMOLOGUES OF THE IDENTIFIED MLO GENE 

The available expressed sequence tag (EST) databases of 
Oryzae sativa (rice) and Arabldopsis thaliana were searched for 
homologous protein sequences. Five Arabidopsis cDNA clones were 
identified whose deduced amino acid sequences show substantial 
similarity to the Mlo protein. Remarkable is cDNA clone 
205N12T7 which reveals a chance probability of 1.2 e"*^. In 
addition, at least one significant homologue was found in rice 
(OSR16381A) . 

A rice BAC library (Wang et ai . , 1995) has also been 
screened with a labelled barley genomic fragment containing 
Mlo. A BAC clone containing an insert of -23 kb was isolated. 
Subsequent subcloning enabled isolation of a 2 . 5 kb Pst I 
genomic rice fragment showing strong cross -hybridization with 
the barley Mlo gene probe. DNA sequencing of this fragment 
revealed remarkable DNA sequence similarities within expn 
sequences of the barley Mlo gene (Figure 5) . 

Finally, a 13 kb X genomic barley clone derived from 
cultivar Igri (Stratagene) was isolated with a labelled barley 
genomic fragment containing Mlo, The nucleotide sequence 
derived from a subcloned 2.6 kb Sac I fragment reveals again 
extensive sequence similarities to the Mlo gene (Fig. 6). The 
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location of the barley Mlo homologue within the genome is not 
within BAG F15 DNA. 

In summary, there is conclusive evidence for Mlo 
homologues both in a monocotyledonous and a dicotyledonous 
plant species . 

Discussion 

TUiy speculation as to mode of action of Mlo and mlo 
nucleic acid and polypeptides should provide no limitation on 
the nature or scope of any aspect or embodiment of the present 
invention. 

In plants, resistance to pathogens is frequently 
determined by dominant resistance genes, whose products are 
assumed to recognize pathogen-: derived avirulence gene products. 
This mode of pathogen defence follows Flor's gene-f or-gene 
hypothesis (Flor, 1971) . Recently, several 'gene-f or-gene' type 
resistance genes have been molecularly isolated {Martin et al . , 
1993; Bent et al . , 1994; Jones et al , , 1994; Mindrinos et al . , 
1994; Whitham et al., 1994; Grant et a J . , 1995; Lawrence et 
al., 1995; Song et al . , 1995). The surprising finding is that 
the deduced proteins share remarkable similar structural 
domains although they trigger resistance reactions to pathogens 
such as viruses, fungi, and bacteria (Dangl, 1995; Staskawicz 
et al . , 1995). The isolated genes code for proteins that either 
contain a leucine-rich region (LRR) , with or without an 
attached nucleotide binding site (NBS) , indicative of ligand- 
bindihg and protein-protein interaction or encode a simple 
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serine/threonine kinase. A structural combination of LRR and 
the kinase domain has been reported in the deduced protein from 
the rice Xst21 resistance gene (Song et al . , 1995). The 
structural similarity of resistance genes in ' gene-f or-gene ' 
defence makes the existence of a common underlying resistance 
mechanisms likely. 

Resistance mediated by recessive resistance alleles of the 
Mlo gene differs in various aspects from * gene -for -gene' 
resistance (see introductory comments above) . The molecular 
isolation of the Mlo gene and the sequencing of various 
mutation- induced mla alleles described here, confirms previous 
interpretations from combined mutational and Mendelian genetic 
studies (Hentrich, 1979; Jargensen, 1983) . It is concluded that 
defective alleles of the Mlo locus mediate broad spectrum 
resistance to pathogens such as the powdery mildew pathogen. 
This is inconsistent with the involvement of a specific 
recognition event of a pathogen-derived product as has been 
proposed for race-specific resistance genes. 

Pleiotropic effects of mlo alleles have provided some 
clues towards the development of a molecular concept of the 
observed broad spectrum resistance response. 

Firstly, aseptically grown mlo plants exhibit at a high 
frequency a spontaneous formation of cell wall appositions 
(CWAs) in leaf epidermal cells (Wolter et al . , 1993). Those 
CWAs are usually formed in response to attempted pathogen 
penetration directly beneath the fungal apressorium. CWAs are 
believed to form a physical barrier against pathogen ingress 
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and have been implicated repeatedly in mlo mediated resistance 
(Bayles, 1990) . 

Secondly, at a later stage, the plants develop 
macroscopically detectable leaf necrotic flecks. The 
5 spontaneous leaf necrosis response has been extensively studied 
with a unique collection of 95 chemically- induced mlo alleles 
(Hentrich, 1979) . The alleles were classified as either showing 
a gradually different infection phenotype upon infection of a 
mixture of nine powdery mildew isolates. Those mlo alleles 

10 which give rise to an intermediate infection phenotype (i.e. 
development of a considerable number of sporulating fungal 
colonies upon inoculation) showed no detectable spontaneous 
leaf necrosis whereas the category of the most effective 
resistance alleles exhibits pronounced necrosis in the absence 

15 of the pathogen. Thus, there is solid evidence that the former 
category of mlo alleles retain residual wild type allele 
activity and those alleles appear to exhibit no detectable 
spontaneous leaf necrosis. 

Thirdly, a constitutive expression of defence-related 

20 genes has been observed in mlo seedlings grown under mildew- 
free conditions - in primary leaves when 10-11 days old; this 
includes genes of the PR-l family, chitinases and peroxidases. 

We have shown that mlo in barley confers increased 
resistance to different types of yellow rust (Pucclnlst 

25 struclformis) when a one to one mixture of talcum powder and 
spores were aviblown onto leaves of mlo barley plants after 
onset of constitutive expression of defence related genes (10- 
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11 day old mlo seedlings) . 

Thus, it appears that multiple defence-associated 
responses are constitutively expressed in mlo plants. 

The temporal relationship of these events is interesting: 
5 the onset of constitutive defence-related transcript 

accumulation is detected in 11 day-old seedlings and precedes 
CWA formation which is followed by the appearance of 
macroscopically visible leaf necrosis. Importantly, however, 
jnlo resistance can be experimentally tested as early as in five 
10 day-old seedlings and is fully functional at this time. We 
conclude that the Mlo protein has a negative regulatory 
function in plant defence and that plants with a defective 
protein are *primed' for the onset of defence responses. 
The deduced amino acid sequence of Mlo reveals no 
15 significant homologies to any of the described plant resistance 
genes so far, supporting the idea of a distinct molecular 
resistance mechanism. The Mlo gene shows also no striking 
similarities to any characterized plant or mammalian gene 
sequence in the various data bases. However, highly significant 
20 homologous sequences have been identified in the EST and 
genomic databases both from rice and Arabldopsls thctliana 
(Table 5 and Figure 5) . This strongly suggests that the Mlo 
protein represents a member of a novel protein family. A 
putative nuclear localization motif (NLS) is found within exon 
25 12 providing indication of nuclear localization of the protein 
(KEKKKVR; Nigg et al . , 1991). The significance of this motif is 
supported by a casein kinase II motif located 14 amino acids 



wo 98/04586 



PCT/GB97A>2a46 



84 

into direction of the NHj- terminus (SIFD; Rihs et al , , 1991). 
Functional tests may examine the putative subcellular 
localization of the Mlo protein. 

Mutations have been described also in other plant species 
in which defence responses to pathogens appear to be 
constitutively expressed (Walbot et al . , 1983; Pryor, 1987; 
Jones, 1994) . It has been suggested that this class of mutants, 
termed lesion mimics (Les) or necrotic mutants (nec) , affect 
the control of plant defence responses. Recessively inherited 
lesion mimic mutants have been systematically analysed in 
ArsLbidopsls thallana (Greenberg and Ausubel, 1993; Dietrich et 
al . , 1994; Greenberg et al . , 1994; Weymann et al . 1995). The 
affected genes have been designated acd (accelerated cell 
death; acdl and acdJ?) or isd (lesions simulating disease 
resistance response; Isdl to Isd7) . 

Each of the mutants exhibits, in the absence of pathogens, 
HR characteristics such as plant cell wall modifications and 
the accumulation of defence-related gene transcripts. Leaves of 
the aed2 mutant have been shown to accumulate high levels of 
salicylic acid and of the Arabidopsis phytoalexin, camelexin 
(Tsuji et al., 1992). Importantly, acd and Isd mutants exhibit 
elevated resistance to a bacterial (P. syringaB) and fungal (P. 
parasitica) pathogen. The Isdl mutant is exceptional in that it 
confers heightened pathogen resistance at a prelesion state, in 
contrast to the other defective loci which exhibit elevated 
pathogen resistance only in the lesion-positive state. In this 
respect, Isdl resembles the mlo mutants in barley. Another 
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Striking feature of Isdl is the indeterminate spread of lesions 
in contrast to the other mutants where lesion growth is 
determinate . 

5 EXPERIMENTAL PROCEDURES 

Plant Material 

A compilation of the mlo mutants and their mother 

varieties analyzed in this study has been described by 
10 Jorgensen (1992) [mla-1., mla-^ , mlo-4, mla-S, mla-1 , mlo-B, 

mlo- 9, mlo-XO, mlo- 11] and by Habekuss and Hentrich (1988) 

[mutants in cultivar Plena 2018 (inIo-13), 2034 (njio-17) , 2118]. 

Since mutant 2118 has not been assigned to an allele number so 

far, we designate the allele here as inla''2S , according to 
15 current numbering in the GrainGene database 

(gopher : //greengenes . cit • comell . edu : 70/77/ . graingenes . ndx/ 

index?mlo) . 

The high resolution map is based on a cross between Ingrid 
Mlo X BC7 Ingrid nilo-3. plants were selfed generating a 

20 segregating F2 population of approximately 600 plants. 
Pheno typically susceptible plants which showed 
heterozygosity for RFLP markers on opposite sites of Mlo were 
selfed and generated further segregants in the F3 generation 
for high resolution mapping. 

25 

Powdery Mildew Infection Testa 

The fungal isolate Kl (Hinze et al . , 1991) is virulent on 
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all cultivars used in this study carrying the Mlo allele and 
avirulent on all tested mlo genotypes. Plant growth and 
inoculation with Eryslphe graminls f sp hordel were carried out 
as described previously (Freialdenhoven et al . , 1996). The 
genotype at Mlo of recombinants used for the high resolution 
map were determined after selfing and subsequent inoculation 
experiments in F3 or F4 families comprising at least 24 
individuals . 

AFLP Ansilysis 

Genomic DNA for AFLP analysis was isolated according to 
Stewart and Via (1993) . AFLP analysis was carried out with 
minor modifications as described by Vos et al. (1995) , For 
screening of AFLP markers linked to Mlo we used the enzyme 
combinations Pst I/Mse I with amplification primers carrying +2 
and +3 selective bases . respectively in genomic sequences of 
amplified fragments. For Eco RI/Mse I amplification primers we 
used +3 and +3 selective bases respectively. A set of four DNA 
templates has been used: from the susceptible parent cultivar 
Ingrid Mlo, the resistant parent BC7lngrid inlo-3, a pool of two 
resistant F2 individuals (inIo-3 jnIo-3) and a pool of nine 
susceptible F2 individuals [Mlo Mlo) derived from the cross 
Ingrid Mlo x BC7 Ingrid inio-3. Amplified genomic fragments 
representing AFLP markers Bpm2, Bpm9, and BpmlG (Figure 1) were 
cloned and sequenced as follows: gel pieces (fixed by vacuum 
drying to Whatman 3MM paper) containing the amplified genomic 
fragments were identified via autoradiography and subsequently 
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excised. 100 fil water were added, boiled for 10 min. and after 
centrif ugation 5 ;xl of the supernatant were used as a template 
for non- radioactive reamplif ication (30 cycles) with the 
selective AFLP primers. Amplification products were isolated 
after agarose gel using a DNA isolation kit (Jetsorb, Genomed 
Inc., USA) . DNA was reated with Klenow polymerase and T4 
polynucleotide kinase and subsequently cloned in the EcoRV site 
of pBluescript SK (Stratagene) . Sequencing reactions were 
performed using a dye terminator cycle sequencing reaction kit 

(Perkin Elmer) and resolved either on an ABI 373 or 377 

(Applied Biosystems) automated sequencer. 

Barley YAC Library and BAC Suhlihrary Construction of YAC 
YHV303'A6 

The YAC library of barley cultivar Ingrid was established 
using the pYAC4 vector (Burke et al . , 1987; Kuhn and Ludwig 
1994) and yeast strain AB 1380. Details of the library 
construction and its characterization will be described 
elsewhere. Screening for YAC clones containing marker Bpml6 
was done by AFLP analysis. For construction of a BAC 
sublibrary of YAC YHV303-A6, total DNA of this yeast clone was 
used. After partial Eco RI digestion and preparative pulsed- 
field gel electrophoresis, DNA fragments in the size range of 
50 kb were recovered and subcloned in the pECSBAC4 vector. 
Clones carrying YHV303-A6 derived inserts were identified by a 
two-step colony hybridization procedure. First total labelled 
DNA of the non -recombinant yeast strain AB 13 8 0 was used as a 
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derived from the host strain. In a subsequent hybridization 
step the remaining clones were probed with the labelled 
recombinant chromosome YHV303-A6 after enrichment by 
5 preparative pulsed- field gel electrophoresis. 

DNA Sequencing of BAC F15 

DNA of BAC F15 was isolated by an alkaline lysis large 
scale plasmid preparation according to Sambrook et al . (1989). 

10 50 /xg of purified DNA were nebulized by high pressure treatment 
with argon gas in a reaction chamber for 150 seconds. The ends 
of the sheared and reprecipitated DNA were blunt -ended by a T4 
DNA polymerase -mediated fill in reaction. DNA fragments in the 
size range between 8 00 bp and 3 kb were isolated from agarose 

15 gels using a DNA isolation kit (Jetsorb, Genomed Inc., U.S.A.), 
subcloned into the pBluescript SK vector (Stratagene) and 
propagated in E. coll DN5a. Clones carrying BAC F15 derived 
inserts were selected by hybridization using the sheared DNA of 
BAC F15 as a probe. Sequencing reactions were performed as 

20 described above. Evaluation of the sequencing data, 

construction of sequence contigs, and estimation of coding 
propabilities were done by means of the STADEN software package 
for Unix users (4th edition, 1994) . Assessment of coding 
probabilities was based on a combined evaluation of uneven 

25 positional base frequencies, positional base preference and 
barley codon usage in the investigated contigs. Homology 
searches were done using the BLAST software . 
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PCR'hased Sequencing of Alleles at Mlo 

Plant chromosomal DNA for this purpose was isolated 
according to Chunwongse et al. (1993). DNA sequences of Mlo 
alleles of the different barley varieties, mlo mutants, BC 
lines, and intragenic recombinants used in this study were 
obtained by PCR-based sequencing- Seven overlapping 
subfragments of the gene (each 400 bp-600 bp in length) were 
amplified by PGR (35 cycles, 60'C annealing temperature) using 
sets of specific primers. After preparative agarose gel 
electrophoresis and isolation of the amplification products 
using the Jetsorb kit (Genomed Inc., U.S.A.) fragments were 
reamplified to increase specificity. The resulting products 
were subsequently purified from nucleotides and 

oligonucleotides (Jetpure, Genomed Inc., U.S.A.) and used as a 
template in DNA sequencing reactions (see above) . All DNA 
sequences of mutant alleles and corresponding regions of the 
parental lines and the intragenic recombinants were derived 
from both strands and confirmed two times in independent sets 
of experiments. In addition, mutant alleles nilo-1, n7lo-3, mlo- 
4, inio-5, inlo-7, mlo- 8, Jiilo-9, and mlo- 10 were also verified in 
the corresponding BC lines in cultivar Ingrid. 

RT-PCR and Rapid Amplification of cDNA Ends (RACE) 

RT-PCR was performed using the SUPERSCRIPT 
preamplif ication system for first strand cDNA synthesis (Gibco 
BRL) . Total RNA (1 fig) of seven-day-old primary barley leaves 
(cultivar Ingrid) served as template. First strand cDNA 
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synthesis was primed by an oligoCdT) primer, The putative 
coding region of the Mlo gene was subsequently amplified using 
oligonucleotides 25L (GTGCATCTGCGTGTGCGTA) and 3 8 
(CAGAAACTTGTCTCATCCCTG) in a single amplification step (35 
5 cycles, 60 *C annealing temperature). The resulting product was 
analyzed by direct sequencing. 5'- and iJ'-ends of the Mlo cDNA 
were determined by RACE (Frohman et al . , 1988) using the 
MARATHON cDNA amplification kit (Clontech) . Corresponding 
experimental procedures were mainly carried out according to 

10 the instructions of the manufacturer. To obtain specific RACE 
products, two consecutive rounds of amplification (35 cycles, 
55 'C annealing temperature) were necessary. For this purpose, 
two sets of nestled primers were used in combination with the 
adapter primers of the kit: oligonucleotides 4 6 

15 (AGGGTCAGGATCGCCAC) and 5 5 ( TTGTGGAGGCCGTGTTCC ) for the 5' -end 
and primers 3 3 (TGCAGCTATATGACCTTCCCCCTC) and 3 7 
(GGACATGCTGATGGCTCAGA) for the 3' -end. RACE products were 
subcloned into pBluescript SK (Stratagene) . Ten 5' -end and 
eight 3 ' end clones were chosen for DNA sequence analysis . 

20 



The term ^"AFLPs" is used herein to refer to ^^AFLP 
markers" . 
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Table 1 summarizes the identified mutation sites of 
various mutants within the Mlo gene. The origin, the mutagen 
and the predicted effect of the mutation at the amino acid 
level are indicated. 

Table 2 shows the results of heteroallelic mlo crosses and 
selfings of the respective mlo lines to isolate intragenic 
recombination events. 

Table 3 summarizes the grenotypes at flanking RFLP markers 
in susceptible or homozygous F3 progeny from the intermutant 
crosses . CO and NCO indicate crossover type and non crossover 
type recombinants deduced from flanking molecular marker 
exchange. Table 3 summarizes DNA sequence analysis of 
suceptible intragenic crossover type recombinants (from 
homossygous susceptible F3 progeny) and the corresponding 
parental mlo mutant lines. Sequences flanking the identified 
mutation sites are shown. 

Table 4 shows the results of direct PGR sequencing of 
genomic DNA of susceptible intragenic recombinants derived from 
both heteroallelic crosses mlo-1 x mlo-S and mla-l x 
•revealing restoration of wild type sequences . 

Table 5 shows several Arabldopais thalianat and two rice 
expressed sequence tags (ESTs) with homology to the Mlo 
protein. 

Table 5A show amino acid sequences, with "query" 
indicating part of the Mlo protein sequence to which homology 
has been found, with the predicted amino acid sequence of each 
identified EST marked with "subject". 
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Table 5B shows EST nucleotide sequences encoding the amino 
acid sequences shown in Table 5A, GenBank Accession number 
T22145 (definition 4153 Arabidopsis thaliana cDNA clone 97N8T7, 
NCBI Seq ID 932185) , number T22146 (definition 4153 Arabidopsis 
5 thaliana cDNA clone 97N9T7, NCBI Seq ID 932186), number N37544 
(definition 18771 Arabidopsis thaliana cDNA clone 205N12T7, 
NCBI Seq ID 1158686) , number T88073 (definition 11769 
Arabidopsis thaliana cDNA clone 155I23T7, NCBI Seq ID 935932) 
number H76041 (definition 17746 Arabidopsis thaliana cDNA clone 

10 193P6T7, NCBI seq ID 1053292), number D24287 (rice cDNA partial 
sequence R1638_1A, nID g428139) and D24131 (rice cDNA partial 
sec[uence R14 08_1A, nID g4 27985) are shown. The Arabidopsis 
sequences are from Newman et al . (1994) Plant PhyBXol. 106 
1241-55. The rice sequences are from Minobe, Y. and Sasaki, T, 

15 submitted 2 Nov 1993 to DDBJ. 
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Table 2 



F2 progeny from intermutant crosses and selflngs 



Testcrosses 


resistant 


susceptible 


frequency of 








susceptible F2 progeny 


mlo-8 X mlo-1 


5.281 


3 


5.7x10-4 


mlo-5 X mlo-1 


915 


0 




mlo-5 X mlo-1 


14.474 


9 


6.2 X 10-4 


selfings 


resistant 


susceptible 




mlo-1 


12.634 


0 




mlo-5 


5.498 


0 




mlo-6 


8,435 


0 
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TABLB 5A 



>EM EST1:AT1452 T22145 41S3 Arat>l<lopsls thaXlana cONA clone 97NeT7 . 11/95 
Length - 382 

Plus Strand HSPs: 

Score « 248 (115.9 bits). Expect - 2.90-27^ P - 2.9e-27 
Identities -= 47/100 (47%)^ Positives « 67/100 (67%)^ Frame - +2 

Query: 242 KriKRSMEDDFianWGISI*PIiWGVAXI-TLFLDINGVGTLIWlSFIPLVXI.LCVGTKLEMI 301 

KY-f* R4*+EDDFK WGIS LW -M- L-h-WG T WI+FIP +I,L VGTKI.B + 
Sbjct: 2 KXl^^IRfOjEODFKQVVGXSWYhmCFVVXFXLLir^ 181 

Query: 302 IMEMALEIQDRASVIKGAPVVEPSNKFFWFHRPDWVIiFFI 341 

I ++A E+ -M- I+G W+P . + FWF 4-P VIH- I 
Sbjct: 182 IAQLAHEVAEKHVAIEGDI*VVKPXXEKFWFSKPQIVLyi.I 301 



>EM EST1:AT1462 T22146 4154 Arabidopsis thaliana CDNA clone 97N9T7. 11/95 
Lengtti - 390 

Plus Strand HSPs: 

Score - 212 (99.1 bits). Expect - ^ -^Kl^/a^^'^aV^KZ^^L^t'lt 
Identities - 41/83 (49%) r Positives - 58/83 (69%). Frame - +2 

Queryi 242 K«:F»SMEDDFKVVVGISLPLWGVAILTI.FU)IMGVGTLI^ 301 
vuei^. jv * • ^ WGIS I*W ++ L X-++NG T WI+FIP +I.L VGTKI* + 

Sbjct: 2 ^SlMRALmDE^^ 

Query: 302 IMBMAI£IQDRASVIKGAPWBP 324 

I ++A B+ ++ I-K3 W+P 
Sbjct: 182 IAQIAHEVAEKKVAIE(2>I.WKP 250 

score - 52 (24.3 bits). Expect - 3^-^' f"?,^ «i ' ^iH^ « « 
Identities - 9/32 (28%) , Positives - 16/32 (50%) r Frame - +2 

Query: 18 wavawfaakvlvsvlmbhglhklgkwpqhrh 49 

K FA ++ V «H +!• H +H 

Sbjct: 122 faAF»FAXJ.IAVGTfaC£HVIAQXJUIEVABra 217 



Spore - 49 (22.9 bits) r Expect - 4.2e-26r Sum P(2) - 4.2e-26 
Identities - 8/17 (47%), Positives - 12/17 (70%), Frame - +1 

Query: 323 EPSNKPFWFHRPDWVLF 339 

E S-I-+ EMF +P VL+ 
Sbjct: 244 BTSDEHFKFSKPQXVLY 294 
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TABLE 5A cont^d 



>EK BST1:AT54418 K37S44 16771 Arabi.dLopsi.3 t ha liana cDNA clone 20SH12T7. 1/96 
Length 585 

PXus Strand HSPs: 

Score - 277 (129.5 bits). Expect - 1.2e-45r Sum P(2) •= 1.2e-45 
Zdentltlee - 51/96 (53%) , Positives - 71/96 (73%) r Frame - +1 

Query: 23e SKPDFHKrZKRSMEDDPKVVVGISLPLWGVAfrLTLFLDZKC^GTLZfaSF^ 295 

S-fFDF KTI-I-RS+E DPK W.IS +W VA+L L + (3+ + +«+ FZPLV<M*L VG 
SbjCt: 127 SBFOFRKYIQRSLBIOJFKTVVfelSPVIWFVAVLFLLTNSYGLRSYLKLPPIPLVVILIVG 306 

Query: 296 TKI»EMIIMEMALEIQDRASVIKGAPWBPSKKFFWF 331 

TKLEH-ZX -M- L IQ+ V++GAPVV+P + FWF 
SbjCt: 307 TKLEVIZTKLGLRZQBBGDWRGAPWQPa^DXFHF 414 

Score - 121 (56.6 bits). Expect - 1.2e-4S, Sum P(2) l«2e-45 
Identities - 25/45 (55%), Positives » 29/45 (64%), Frame - -l-l 

Query: 196 SSTPGZRHWAFFRQFFRSyTKVDYLTLRAGFXMAHLSpNSKFbF 240. 

S T HH-V.FFRQFP SVTKVDVI. L (;PI AH -I- 4+ F 
Sbjct: 1 SKTRVTI.WZVCFPRQFFGSVTKVDrLAI0CH6PZMAHFAPGNBSRF 135 



>EH EST1:AX04X17 1176041 17746 Arabldopsls thaldana cDNA clone 193P6T7. 11/95 
Length - 476 

Plus Strand HSPs: 
Score - 210 (90.2 bits). Expect - 9.0e-36, Sum P(2) - 9.0e-36 
Identities «- 43/86 (50%), Positives -^ 58/86 (67%), Frame - <«*1 

Query: 196 SSTPGXRf^AFFRQFFI^VTKVDYLTLIlAGFXNAHLSQNSiCFDFHKyXlCRSMBD 255 

++TP V FFRQFF SV + DYLTLR GF -fAKL^- KF+F +YZK S+EDDPK+V 
Sbjct: 124 TTTPFXFKVGCFFRQFFVSVERTDrLTLtmGFXSAHIAPGRKFM^ 303 

Query: 256 VGZSLPLifGVAZLTLFLDZHGVGtLZ 281 

VGZ LW L + /fGT-M- 

Sbjct: 304 VGZXPVLWASFVZFLAVQX*WLGTZV 381 

Score - 119 (55.6 bits). Expect - 9.0e-36, Sum P(2) - 9.0e-36 
Identities - 24/57 (42%), Positives - 32/57 (56%), Frame - 4*1 

Query: 156 MRTfmKWETBTTSIXYQFANDPARFRFTHQTSFVKRRZ^LSSTPGZRWV^^ 212 

HKKHB T S +Y F D +R R TH+TSFV+ H -l-T + V F + F 
Sbjct: 1 ZIU5HKlCfmQXTX.Sin>YXFXZDHSIUiRX«THETSFVRBHTSFinrTTPFXFHV^^ 171 



Score - 40 (18.7 bits). Expect - 1.2e-08, Sum P(2) - l.2e-08 
Identities - 8/19 (42%) , Positives - 10/19 (52%) , Frame « 4-2 

Query.; 269 TWLDZNGVGTLrWISFIP 287 
<f KG G L H S P 

Sbjct: 344 SLl«FNXHGHGPI#PHASVPP 400 
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TABLE 5A cont'd 



>BM ESTI:AT0739 T88073 11769 Arabititopsls challana cDNA clone IS5I23T7. 11/9S 
Length •460 

Plus Strand H8Ps : 

Score - 175 <81,e bits) # Eaq>ect « 1.2e-*24, Sum P<2) - 1.2e-24 
Xctentitles - 31/67 (46%) r Positives ■» 43/67 (64*)/ rrame - +1 

Query; 146 VITIAI^IU:.iamTWKIC*irBTETTSLEYQFAroPiUirRFTHQTSFy^^ 205 

•I-+T A 4^fCMRTHK WE ET ++EXQ-f-fr-NDP RFRF TSF 4-RIU. S 4- + 
Sbjct: 4 XVTYAroKIKMRTWKSWEEETKTIEYQYSNDPERPRFAIU>TSFGIUlHLIi™ 183 

Score - 121 (56.6 bits). Expect - 1.4e-14, Sum P(2) - 1.4e-14 
Klentltles « 25/45 (55%) , Positives 29/45 (64%) , Fraxae - +1 

Query: 196 SSTPGIRWWAFFRQFFRSVTKVDYLTLRAGPIMAHI^SQNSKFDF 240 

S T WW FFRQFF SVTKVDYL L GFI AH + ++ F 
Sbjct: 157 SKTRVTLWIVCFFRQFFC5SVTKVDYLALXHGFXKAHPAPGNESRF 291 

Score - 75 (35.1 bits). Expect - 1.2e-24, Sum P(2) - 1.2e-24 
Identities « 14/21 (66%), Positives - 17/21 (80%), Frame - +1 

Query: 236 SKFDFHKYXKRSMEDDFKVW 256 

S4-FDF ICYX4-RS4- DPK W 
Sbjct: 283 SRFDFRKYXQRSZOOCDFICTVV 345 



>EM ESTS:OSR16381A D24287 Rice cDNA, partial sequence (R1638_1A) . 5/95 
X^engrth • 400 
Plus Strand HSPs: 

Score - 147 (68.7 bits). Expect l.9e-16. Sum P(2) - 1.9e-16 
Identities « 26/53 (49%), Positives - 35/53 (66%), Frame - +1 

Query: 236 SKFDFHKYIKRSMEDDFKVVVGISLPI.WGVAILTLFtDIllGVGTLlWISFIPI. 288 

++F+F KYXfCR -I^EDDFK WGIS P W A+ + +++G L W S PL 
Sbjct: 202 TRPMFRKYIKRXLEDDFKTVVGISAPaCWASAIAIMLPNV^ 3W 

Score - 45 (21.0 bit:8) , Baq;>ect - 1.9e-16, Sum P(2) - 1.9e-l6 
Identities - 9/15 (60%), Positives - 11/15 (73%), Frame - +2 

Query: 287 PLVXLLCVCTKLBMI 301 

PL 4- L VGTKL+ X 
Sbjct: 356 PLXVTLAVC^TKLQAI 400 



>EM EST5:OSS1692A D39989 Rlce cDI<IA, partial sequence (S1692_1A) • 11/94 
Ijength -343 

Plus Strand HSPs: 
Score - 95 (44.4 bles) , Expect - 0.00059, P - 0.00059 
Identities - 24/58 (41%), Positives - 31/58 (53%), Frame - +3 



Query: 43 MWFQHRHKKALWEALEKMKAELMLVGFISLLLIVTQDPIIAKICISEDAADVHWPCKR 100 

H 4^ H4- L 4-A-l-EKMK B+ML-KJFISLLL T I S+ PC R 

Sbjct: 3 HXSEKTHRMPLHKAMEKMKBEMMLLGFISLLLAATSRIISGICIDSKYYNSNFSPCTR 176 
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TABLE SB 



1 00 



GenBank AccesBlon Humber T22145 



1 caag^a^atg 
61 tctttggntc 
121 ctggatagca 
181 nattgcacag 
241 ggtgaaaccc 
301 ccca^ttitat: 
361 ggggtiaanan 



atigcgcgctic 
t:t:tgtcgtca 
tttattccct 
t^agctcatg 
ncanabgagc 
ccticti^ticcc 
annggtiti^cg 



tagaggatga 
tctttttnct 
ttnctttgct 
aagttgcaga 
atttctggtt 
agaatigcntt 
nc 



t^tcaaacaa 
gct:aaat:gtt 
t:ct:1;gct:gt:g 
gaaacatrgta 
cagcaaacct 
t:t:nagant;gc 



gtt:gt:t:ggta 
aacggatggc 
ggaacaaagt 
gccattrgaag 
caaattgtt:c 
nttttttnnt 



tt:agt:^ggta 
acacatattt 
tggagcatgt 
gagacttagt 
tctacttgat 
tttggnnttt 



GenBank Accession Number T221A6 



1 caagt:atatg 
61 tct:t:t:ggntc 
121 ctggatagca 
181 nat:t:gcacag 
241 ggt:gaaacct 
301 cnct:t:t;atcc 
361 ttgggnnnnc 



at:gcgcgct:c 
t:t:tgtcgtca 
t^^attccct 
t:t:agctcatg 
cagat:gagca 
ccct:t:ccaga 
aaacgggntt 



^agaggatga 
tctttttgct 
ttgctttgct 
aagtt:gcaga 
tttctggttc 
at:gcctt:1:tt: 
nggacclzccg 



tt:t:caaacaa 
gctaaatigtt: 
tcttgctgtg 
gaaacat:gt:a 
agcaaacctc 
nangatccnn 



gttg^tggta 
aacgga^ggc 
ggaacaaagt 
gccattgaag 
aaantgttct: 
ntttttcctt 



ttagt:t:ggta 
acacat:attt 
tggagcatgt 
gagacttagt: 
ctactogatc 
ntt:ggannt;t 



GenBank Accession Nuinber N3754A 



1 agcaagacga 
61 accaaag^tg 
121 aacgaat:caa 
181 accgti^gttg 
241 tcat:atggat 
301 gtt:ggaacaa 
361 gatgtggtga 
421 cacgnt:^caa 
481 ctt:tncc^gg 
541 cataggc^tt 



gag^cacact 
at:tactt:agc 
gat:t:cgattt 
aaat:cag^cc 
tacgt^ct^a 
agc^tgaagt 
gaggcgcccc 
tnttttccnt 
ggncggatga 
nggtgggntt 



atggattgtt 
actaagncat 
ccgcaagtat 
ggttatctgg 
cctctggtta 
cataataaca 
agtggttcag 
antcacttng 
ttcaatccaa 
iLtcaganttt 



tgttttttta 
ggttitcatca 
attcagagat 
tttgtcgctg 
ccat1:ca^tc 
aaattgggtc 
cctggtgatg 
gcctttt^an 
naatnttccc 
nagtttggct 



gacagttctt 
tggcgcattt 
cattagagaa 
tgctattcct: 
cactagtcgl: 
taaggatcca 
accncttc^g 
gggtgaatt:t 
ctgaagnct:n 
tnccc 



tggat:ctgtc 
tgctcccggt 
agac^tcaaa 
ct^gaccaat 
aat:^ctaat:a 
agaggaaggt 
gtttngnaan 
caact:t:cat:n 
caag^^t:ggg 



SUBSTITUTE SHEET (RULE 26) 



wo 98/04586 PCT/GB97/02a46 

101 

TABLE 5B (Continued) 



GenBank Accession Number T88073 

1 tgcattgtta cttatgcttt cggaaagatc 

61 gagacaaaga caa^agagta tcagta^^cc 

12 X gacac^^ctrt: tt:gggagaa9 aca^ct:caat: 

181 attgtttgtt tttttagaca gttctttgga 

241 agncatggtt tcatcatggc gcattttgct 

301 aagtatattc agagaticatt agngnaagac 

361 tatctggttt gtcggctgtg ctattccnct 

421 tggtaccatt at:t:cnctagc ggaatntiaaa 



aagatgagga cgtggaagtc gtgggaggaa 

aacgatcct:g agaggt:tcag gtttgcnagg 

ttctggagca agacgagagt cacactatgg 

tctgtcacca aagttgatta cttagcacta 

cccggtaacg aatcaagatt cgat:t:t:ccgc 

ttcaaaaccg ttgtttgaaa tcagtccggt 

tgaccaat:t:c atatggntnc ggt:nt:tncnc 
agttggcnga 



GenBank Accession Number H76041 



1 attcgtiggat: ggaaaaagtg ggagcaagan 

61 gatcat^caa gact^tiaggct cactca^gag 

121 tggacaacaa cncc^t^ctin ct:^l:aacgt:c 

161 gtngaaagaa ccgactiactt gactctgcgc 

241 ggaagaaagt: t:caac1:1:cca gagatatatc 

301 gtagttggaa taagnccagt tctttgggca 

361 t:aa^ggc^gg ggaccat:t:g^ tttgggcntc 

421 tt:ggccaagg ^tcaaggaat t:tngggacaa 



acattatcta atgactatna gtttnctatt 

acttcttttg tnagagaaca tacaagtt:tc 

ggatgcttct ttaggcagtt ctttgtatct 

cat:ggat:t:ca nct;ct:gccca tCtiagctcca 

aaanga^c^c tcgaggat^ga t^l;caagt:t:g 

tcattl;gt;aa tcttccttgc tgt:tcaat:gn 

ggtaccgcct ntiactcanaa ncccaggctt 

tggggtagaa tcgtgggcnc atnngg 



GenBank Accession Number D24287 

1 tcntntttnn ttttcgnntn cntccacccc 
61 tntitn^totc ncntntcccn ncaccaccnn 
121 aggctgccca ctgncgtctg agacctacct 
181 tgct:cact:tt atctctacgg gacteggttc 
241 gaggacgatit ^taagacagt tgtt:ggcatt 
301 att:at:gct:ct tcaatgttca tggatggcat 
361 gnt:agt:aact ttagcagt^g gaacaaagct 



tnnnntmctc nancncnt:tn nnnttatctc 
ncgacgggcn tggactnngc ccnnngttcg 
tgncattt:ga cggcacngga cttcanttgc 
aattt:tcgga aatacatcaa aaggncactg 
agtgcacccn 1:aegggct:t:c tgcgt:^ggcc 
aaottgttct- ggttctc^ac aatncccctt 
gcaggcta^a 



GenBank Accession Number D24131 

1 cagactacct gBCt:tt:gagg cacggattca 

61 tcaa^^^^cg gaaatacatrc aaaaggt^cac 

121 ttagligcacc ct:t:atgggct; t:ct:gcg^t>gg 

181 a^aact:t:gt:t: c^ggttct:Ct: acaatccccc 

241 tgcaggctiat: aat:t:gcaat:g atggctgttig 

301 gaatgccggt ggtgaactca gtgat 



tt:gctgct:ca tttatctct:a gggact:aggt 
tggaggacga ttitrtaagaca gttgt:t:ggca 
ccatt:atgct: ctt:naa^gt:t catgga^ggc 
ttgtagt:aac tttagcagfct ggaacaaagc 
aaat:fcaaaga gaggcat;aca gt:aa1:^caag 
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CLAIMS ; 

1. An isolated polynucleotide encoding a polypeptide which 
includes the amino acid sequence shown in Figure 2 . 

5 

2. A polynucleotide according to claim 1 wherein the coding 
sequence is the coding sequence shown in Figure 2. 

3. A polynucleotide according to claim 1 wherein the coding 
10 sequence is a mutant, allele, variant or derivative of the 

coding sequence shown in Figure 2, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides . 



15 4. TVn isolated polynucleotide which on expression in a 
transgenic plant exerts a negative regulatory effect on a 
pathogen defence response of the plant, which defence response 
is pathogen independent and autonomous of the presence of 
pathogen, the polynucleotide encoding a polypeptide which 

2 0 includes an amino acid sequence which is a mutant, allele, 
variant or derivative of the Barley Mlo sequence shown in 
Figure 2, or is a homologue of another species or a mutant, 
allele, variant or derivative thereof, the amino acid sequence 
differing from that shown in Figure 2 by way of addition, 

25 substitution, deletion and/or insertion of one or more amino 
* acids . 
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5 . A polynucleotide according to claim 4 encoding a 
polypeptide which includes the amino acid sequence shown in 
Figure 13 . 

5 6 . A polynucleotide according to claim 5 wherein the coding 
sequence is that shown in Figure 10. 

7. A polynucleotide according to claim 5 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
10 coding sequence shown in Figure 10, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides . 

8 . A polynucleotide according to claim 4 encoding a 
15 polypeptide which includes the amino acid sequence shown in 
Figure 14 . 

9 • A polynucleotide according to claim 8 wherein the coding 
sequence is that shown in Figure 11 . 

20 

10. A polynucleotide according to claim 8 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
coding sequence shown in Figure 11, by way of addition, 
deletion, substitution and/or insertion of one or more 

25 nucleotides. 

11. A polynucleotide according to claim 4 encoding a 
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polypeptide which includes the amino acid sequence shown in 
Figure 15. 

12 . A polynucleotide according to claim 11 wherein the coding 
sequence is that shdwn in Figure 12. 

13. A polynucleotide according to claim 11 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
coding sequence shown in Figure 12, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides . 

14- A polynucleotide according to any preceding claim bperably 
linked to a regulatory sequence for expression. 

15 . An isolated polynucleotide encoding a polypeptide which on 
expression in a transgenic plant produces a polypeptide which 
can stimulate or maintain a defence response of the plant, the 
encoded polypeptide including an amino acid sequence which is a 
mutant, allele, variant or derivative of the Barley Mlo 
sequence shown in Figure 2 or of a homologue of another 
species, the amino acid sequence differing from that shown in 
Figure 2 by way of addition, substitution, deletion and/or 
insertion of one or more amino acids.. 

16. A polynucleotide according to claim 15 which stimulates or 
maintains said defence response of the plant on homozygous 
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expression in the plant. 

17. A polynucleotide according to claim 15 wherein the amino 
acid sequence includes an alteration identified in Table 1. 

18. A polynucleotide according to claim 17 wherein the amino 
acid sequence is that of Figure 2 including a substitution at 
residue 24 0. 

19. A polynucleotide according to claim 17 wherein the amino 
acid sequence includes Leucine at residue 240. 

20. A polynucleotide according to any of claims 15 to 19 
operably linked to a regulatory sequence for expression. 

21. An isolated polynucleotide which has at least about 600 
contiguous nucleotides of the nucleotide sequence of any of 
claims 1 to 13 or complement thereof 

22. A polynucleotide according to claim 21 operably linked to 
a regulatory sequence for transcription. 

23 . An isolated polynucleotide which has at least about 300 
contiguous nucleotides of the sequence of any of claims 1 to 
13, or complement thereof, operaibly linked to a regulatory 
sequence for transcription. 
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24. A polynucleotide according to claim 22 or claim 23 wherein 
the regulatory sequence includes an inducible promoter, 

25. A nucleic acid vector suitable for transformation of a 
5 host cell and including a polynucleotide according to any 

preceding claim. 

26. A nucleic acid vector according to claim 25 wherein said 
host cell is a microbial cell. 

10 

27. A nucleic acid vector according to claim 25 wherein said 
host cell is a plant cell . 

28. A host cell containing a heterologous polynucleotide or 
15 nucleic acid vector according to any preceding claim. 

29. A cell according to claim 28 which is microbial. 

30. A cell according to claim 28 which is a plant ciell. 

20 

31. A cell according to claim 30 having said heterologous 
polynucleotide incorporated within its genome. 

32- A cell according to claim 31 having more than one said 
25 polynucleotide per haploid genome- 



33. A cell according to any of claims 30 to 32 which is 
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comprised in a plant . 

34. A plant including a cell according to any of claims 30 to 
32. 

5 

35. A plant which is a sexually or asexually propagated off- 
spring, clone or descendant of a plant according to claim 34, 
or any part or propagule of said plant, off -spring, clone or 
descendant . 

10 

36. A part or propagule of a plant according to claim 35. 

37. A plant according to claim 34 which does not breed true. 

15 38. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 
claims 1 to 14 into a plant cell and regenerating, a plant from 
said plant cell. 

20 39. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 
claims 15 to 20 into a plant cell and regenerating a plant from 
said plant cell . 

25 40. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 
claims 21 to 24 into a plant cell and regenerating a plant from 
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said plant cell. 

41. A method according to any of claims 38 to 40 including 
sexually or asexual ly propagating or growing off -spring or a 

5 descendant of said plant . 

42. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 1 to 14 

10 within cells of the plant. 

43. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 15 to 20 

15 within cells of the plant. 

44. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 21 to 24 

20 within cells of the plant . 

45. A method of producing a polynucleotide encoding a 
polypeptide which on expression in a transgenic plant produces 
a polypeptide which can stimulate or maintain a defence 

25 response of the plant, the method including alteration of the 
nucleotide sequence of a polynucleotide according to any of 
claims 1 to 14 . 
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46, A method according to claim 45 involving site-specific 
sequence mut at ion . 

47, A method according to claim 45 involving intracellular 
5 homologous recombination, 

48. A method wherein following alteration of a nucleotide 
sequence in accordance with the method of claim 45 a 
polynucleotide including the altered nucleotide sequence is 

10 introduced into a host cell. 

49. A method according to claim 48 wherein the host cell is a 
plant cell, 

15 50. A method wherein following introduction of a 

polynucleotide into a plant cell in accordance with claim 4 9 a 
plant is regenerated from the cell or descendants thereof 
including the altered nucleotide sequence. 

20 51. Use of a polynucleotide according to any of claims 1 to 14 
for stimulating a defence response in a plant. 

52. Use of a polynucleotide according to any of claims 15 to 
20 for stimulating a defence response in a plant. 

25 

53 . Use of a polynucleotide according to any of claims 21 to 
24 for stimulating a defence response in a plant. 



wo 98/04586 



PCT/GB97/02046 



116 

54 . Use of a polynucleotide according to any of claims 21 to 
24 for down -regulation of expression of a gene encoded a 
polypeptide encoded by a polynucleotide according to any of 
claims 1 to 14 . 

5 

55 . Use of a polynucleotide according to any of claims 1 to 14 
in the production of a transgenic plant . 

56 . Use of a polynucleotide according to any of claims 15 to 
10 20 in the production of a transgenic plant. 

57. Use of a polynucleotide according to any of claims 21 to 
24 in the production of a transgenic plant. 

15 58. A method of determining the presence of a pathogen 

resistance or susceptibility allele in a plant or plant cell, 
the method including analysing a sample from the plant or plant 
cell by: 

(a) comparing the sequence of nucleic acid in the sample 
20 with all or part of the nucleotide sequence shown in Figure 7 

to determine whether the sample from the patient contains a 
mutation; 

(b) determining the presence in the sample of a 
polypeptide including the amino acid sequence shown in Figure 7 

25 or a fragment thereof and, if present, determining whether the 
polypeptide is full length, and/or is mutated, and/or is 
expressed at the normal level; 
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(c) performing DNA fingerprinting to compare the 
restriction pattern produced when a restriction enzyme cuts 
nucleic acid in the sample with the restriction pattern 
obtained from the nucleotide sequence shown in Figure 7 or from 

5 a known mutant, allele or variant thereof; 

(d) contacting the sample with a specific binding member 
capable of binding to nucleic acid including the nucleotide 
sequence as set out in Figure 7 or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 

10 including nucleic acid hybridisable with the sequence of Figure 
7 or a polypeptide including a binding domain with specificity 
for nucleic acid including the sequence of Figure 7 or the 
polypeptide encoded by it, or a mutated form thereof, and 
determining binding of the specific binding member ; 

15 (e) performing PGR involving one or more primers based on 

the nucleotide sequence shown in Figure 7 to screen the sample 
for nucleic acid including the nucleotide sequence of Figure 7 
or a mutant, allele or variant thereof, 

2 0 59. A method of determining the presence of target nucleic 

acid in a plant or plant cell, the method including contacting 
a nucleic acid molecule which includes the nucleotide sequence 
shown in Figure 7 or an oligonucleotide fragment thereof with 
nucleic acid in a sample from the plant or plant cell and 

25 assessing hybridisation of said nucleic acid molecule with 
nucleic acid in the sample. 
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60. A method according to claim 59 which involves 
amplification of nucleic acid to which said nucleic acid 
molecule hybridises . 

61. A method according to claim 59 or claim 60 wherein said 
nucleic acid molecule includes an alteration in sequence 
compared with the nucleotide sequence shown in Figure 7 or 
corresponding fragment thereof . 

62. A method according to claim 61 wherein said alteration is 
selected from those shown in Table 1. 

63. An assay method for identifying a compound able to bind 
the • polypeptide encoded by the polynucleotide of any of claims 
1 to 14 or any of claims 15 to 20, the method including: 

(a) bringing into contact said polypeptide or a fragment 
thereof, and a test compound; and 

(b) determining interaction or binding between said 
polypeptide or fragment thereof and the test compound. 

64 . An assay method according to claim 63 wherein a compound 
is identified which is able to bind the polypeptide for which 
the amino acid sequence is shown in Figure 2 . 

65. An assay method for identifying a compound able to 
stimulate a defence response in a plant by interaction with the 
polypeptide encoded by the polynucleotide of any of claims 1 to 
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14 or any of claims 15 to 20, the method including: 

(a) contacting a plant or plant part with a test compound and 
determining stimulation of a defence response; and 

(b) bringing into contact said polypeptide or a fragment 
thereof with a test compound and determining interaction or 
binding between said polypeptide or a fragment thereof and the 
test compound; 

step (b) being performed with a test compound which tests 
positive in step (a) , or step (a) being performed with a test 
compound which tests positive in step (b) , or steps (a) and (b) 
being performed in parallel . 

66. 7\n assay method according to claim 65 wherein stimulation 
of a defence response is determined by monitoring pathogen 
growth and/or viability on the plant or plant part. 

67. An assay method according to claim 65 or claim 66 wherein 
a compound is identified which is able to bind the polypeptide 
for which the amino acid sequence is shown in Figure 2. 

68. An assay method according to any of claims 65 to 67 
wherein a compound is identified which is able to stimulate 
resistance to powdery mildew in barley. 

69. A method which includes following identification of a 
compound as being able to stimulate a defence response in a 
plant in accordance with any of claims 65 to 68 formulation of 
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the compound, or optionally if the compound is peptidyl nucleic 
acid encoding it, into a composition including at least one 
additional component. 

70. A method which includes following identification of a 
compound as being able to stimulate a defence response in a 
plant in accordance with any of claims 56 to 58 application of 
the compound, or optionally if the compound is peptidyl nucleic 
acid encoding it, to a plant. 

71. Use of a polypeptide encoded by a polynucleotide according 
to any of claims 1 to 14, in screening for compounds able to 
stimulate a defence response in a plant.. 

72. Use of a polypeptide encoded by a polynucleotide according 
to any of claims 15 to 20, in screening for compounds able to 
stimulate a defence response in a plant. 

73. A compound able to stimulate a defence response in a plant 
identified by a method according to any of claims 63 to 68. 
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MS D K KGV PAR E L P ET P S WAV 
ATGTCGGACAAAAAAGGGGTGCCGGCGCGGGAGCTGCCGOAGACGCCGTCGTGGGCGGTG 60 

AVVFAAMVLVSVLMEHGLHK 
GCGGTGGTCTTCGCCGCCATGGTGCTCGTGTCCGTCCTC ATGG AACACGGCCTCC ACAAG 120 

LGHWFQHRHKKAbWEA LEKM 
CTCGGCCATTGGTTCCAGCACCGGCACAAGAAGGCCCTGTGGGAGGCGCTGGAGAAGATG 180 

KAELMLVGFISLLLIVTQDP 
AAGGCGG AGCTCATGCrrGGTGGGCTTCAT ATCCCTGCrCCTC ATCX;TC ACGC AGG ACX^ 240 

I lAKICISEDAADVMWPCKR 
ATCATCGCCAAGATATGCATCTCCG AGG ATGCCGCCG ACGTC ATGTGGCCCTGCAAGCGC 300 

GTEGRKPSKYVDYCPEGKVA 
GGCACXGAGGGCCGCAAGCCCAGCAAGTACGTTG ACT ACTGCCCXSG AGGGCAAGGTGGCG 3 60 

LMSTGSLKQLKVP IFVbAVF 
CTCATGTCCACGGGCAGCTTGCACCAGCTGCACGTCTTCATCTTCGTGCTCGCGGTCTTC 420 

HVTYSVITIALSRL KMRTWK 
CATGTCACCTACAGCGTCATCACCATAGCTCTAAGCCGTCTCA^SIaTGAGAACATGGAAG 4 80 

KWETETTSLEYQFANDPARF 
AAATGGG AG ACAG AGACC ACCTCCTTGG AAT ACC AGTTCGC AAATGATCCTGC ACGGTTC 5^0 

RFTHQTSrVKRH LGLS STPG 
CGGTTCACGCACCAGACGTCGTTCGTGAAGCGCCACCTGGGCCTCTCCAGCACCCCTGGC 600 

IRWVVAFFRQFFRSVTKVDY 
ATCAGATGGGTGGTGGCCTTCTTCAGGCAGTTCTTCAGGTCAGTCACCAAGGTGGACTAC 660 

L TLRAGFINAH LSQNSKFDF 
CTGACCTTGAGGGCAGGCTTCATCAACGCGC ATTTGTCGCAAAACAGCAAGTTCG ACTTC 720 

HKYIKRSMEDDFKVVV GISL 
CACAAGTACATCAAGAGGTCGATGGAGGACGACTTCAAGGTCGTCGTCGGCATCAGCCTC 780 

PLWGVAI LTLFLD INGVGTJU 
CCGCTGTGGGGTGTGGCG ATCCTC ACCCTCTTCCTTG ACATC AATGGGGTTGGC ACGCTC 640 

IWISFIPLVIL LCVGTKLEM 
ATCTGGATTTCTTTCATCCCTCTCGTGATCCTCTTGTGTGTTGGAACCAAGCTGGAGATG 900 

I I MEMALEIOD RASVI KGAP 
ATCATC ATGG AG ATGGCCCTGG AGATCCAGG ACCGGGCG AGCGTCATC AAGGGGGCCCCC 9 60 

VVEI>SHKFFWFMRPDWVLFF 
GTGGTCGAGCCCAGCAAC/iAGTTCTTCTGGTTCCACCGCCCCGACTGGGTCCTCTTCTTC 1020 

I H L T L F O N A F Q M A M F V v; T V A 
ATACACCTGACGTTGTTCCAGAACGCGTTTCAGATGGCGCATTTTGTGTGGACAGTGGCC 1080 

T P G L K K C Y H T Q I G I. S I M K V V 
ACGCCCGGCTTGAAGAAATGCTACCACACGCAGATCGGGCTGAGCATCATGAAGGTGGTG 1140 

VGLALQFLCSYMTFP LYALV 
GTGGGGCTAGCTCTCCAGTTCCTCTGCAGCTATATGACCTTCCCCCTCTACGCGCTCGTC 1200 

TQMGSNMKflSIFOEQTSKAL 

acacagatgggatcaaacatgaagaggtccatcttcgacgagcagacgtccaaggcgctc 1260 

TNWUNTAKEKKKVROTDMLM 
ACCAACTGGCGGAACACGGCCAAGGAGAAGAAGAAAGTCCGAGACACGGACATGCTGATG 1320 

O M r G O A T l> S R G S S P M P 3 R G 
GCTCAGATGATCGGCGACGCAACACCGAGCCCAGGC TCGTCGCCGATGCCGACCCCGGGC 1300 
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SSPVHLLH KGMGRSDDP'QSA 

TC ATC ACCCGTGC ACCTGCTTCAC AAGGGCATGGGGCGGTCGGACGACCCCCAG AGCGCG 14^0 

PTSPRTQQEARDMYPVVVAH 

CCCACCTCGCCAAGGACCCAGCAGGAGGCTAGGGACATGTACCCGGTTGTGGTGGCGCAC 1500 

PVHRI.NPNDRRRSASSSALE 

CCGGTGCACAGACTAAATCCTAACGACAGGAGGAGGTCCGCCTC6TCGTCGGCCCTCGAA 1560 



ADIPSADFSFSQG * 
GCCGACATCCCCAGTGCAGATTTTTCCTTCAGCCAGGGATGA 1602 
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292 mxMACCTC ATftfynfa^TOQfy?^ 341 

II :llll llltll IMIMICIItlllMIII Nil III I 11 

80 GCJmAGCTGATGCTGCTGGGCTTCIlTKTCCCTGCTTCTCAOC^ 129 

342 (yg/VCa:CATCATC(yyAAfiAXATfiCftTCT(X 391 

ti If nil iiii:ri Mini II M II III II n 

130 GGCX^OC. . «CATCTCCAANATCTGCATCCCX:AA6TC6GC^ 176 

392 TCTGGCrr!T<y^RRrU-;fV^fyyrArraAGa^ . AAGCCCAGCAAGTACnT 440 

III III IIIMI 111:11111111 :l n I n 
177 TGTTGCC6TGCAAGGCAGGCCNAGATGCCATOGAAGAANAAGCAGCAAGT 226 

441 TCArTAfr rC^rnf^nAO CTGAC^AGCAGAGCCCGGACCAGC^ 490 

I I : I : I n n II I I I I n i i ii i n 

227 GGTCKCCMGTCC.TTGGCCGGCGCCGGCGGCGGGGACTACTGCTCNAAAT 275 

491 TGATGAAGAAATCAATACC ! . . I .GAACTTTTTCTTGTTTTCT 528 

I If I n : n Iff It : : : 

276 TCGATGTGAGAATAACNCCAGCTGCCGGCAAGCACAACCTCGATNCNATN 325 

529 TCTGATTGTCGTCTTGGCTTGGCTTAATTGGTGTGTGTGTGTGTGTTTGC 578 

11:111 I n n 1 1 I I I I I I I n f 

326 ACTNATT TAACTATAATTGATTTTTCTTGGGTTTTCTGC 364 

579 A GGGCAAC^TGry^flQTr ATnTfy^ACr SGTir AGnTTGCACnAGCT^ figfl 

niifiiiinnnf niii \ mi iif f tiMiiriM i 

365 AGGGCAAGGTGGCGCTGATGTCGGCAAAGAGCATGCACCAGCTGCACATT 414 
629 nGATcyrTnaTCu^rc^a^ 678 

Minniniiiiiii ii iiiiiiii iiiiii n iniMini 

415 TTCATCTTOGTGCTCGCCGTGTTCCATC^TACCTACTGCATCATCACCAT 464 

67 9 AfyrrCTAA ryjCCTWrAAA GTCAGCCT [tCTTCTTCTT .723 

I I II I n II II M if I M I I M If 111 II 

465 GGGTTTAGGGCX;CCTCAAAGTGAGTTTGTCGTTCTGTCCCTCATGCACAT 514 

724 CTTTTACX:.! GCACGTCTGTCTGTCAGGCGTACCTACCTGTTCA 765 

MM f I M : f M I I I f I I f Ml 

515 GTTTTCTCTAGTTCTAGCAANATTGTCAGTCCTTCAAATGGATTGTTTCG 564 

766 TCAGGCTTGAGTAAAACTGTTCCATAATCTGC !tCCGGCATAA 807 

M M M I M MM III f f in 

565 ACA AGAAACXXiAATTTATTAATTTGCCAGTTAAATATATAATAA 608 

808 TCCTCTCCXCCTG r-AGATGAGAArATCGAAGAAATGGGAGACAGAG 853 

I MM I II M f t t If M M If I I I I I I M n 

609 TTGATCTTTCTTGGTTTTAGATGAAGAAATGGAAGAAGTGGGAGTCACAG 658 

854 ArrAnrTf CTTGGAATArnAGTTr^AAATn GTCAGGA 903 

n M 11 1 n n I M I II M M It I i it t i i 

659 ACCAACTCATTGGAGTATCAGTTCGCAATCGGTAGTG . ^ AATTAA 701 

904 CAAtCTCCc! . .CTTCTTCGAAACXyulACC TGATGATCCATTTAAA 946 

llinnt II Mi I MM MM II II Ml 
702 GAATCTCCCTAACTATTTCATTTCAGAAOCTTTATGATAATGTCTTGAAA 751 

947 GAOGCAGGCACGATCAGAGTGAGTGAACTGATGTATGTTCATTTTTTGTG 996 

III I 1 i M i III I MM I 
752 GAGGAGGAGCAAATCAG.CTGAAAAATATGATCGA 785 

997 TCt:TTTCAG ATncrTpr Arvr<yrTrrGGTTCLACGCAcrAGAnGTn^ 1046 
786 TCi^TGclGATa^TTC^ 835 
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1047 AAorfirrrAr-f^oYsry: ryyivrrrirji rsrAryyTnTfyy^ATg^^ 1093 

Mill II mil UN I I III II III I HtlMII III 

836 AAGCQGCATCTGGGATCATTCTCAAGCACOCCTGGGCTCAGATGGATCGT 885 



1094 GAGTTTTTTAGCtTCTTATCTGCCOCTCATCTGTGTGTAATGTT , 

mil I I II I II Ml III I I II 

GAGTTATCAATCTCCGAAT. . . , 



1137 



886 
1138 
929 



. . . .ACATCCTTGTTTTTTATTCTTGCA 928 

, .TGGCGTA." ^TGGAGTCAGGTGATTT. • . I ACCTT 1165 

ml II I I II I IN I I 

ACTGGCCTAGCTOTTCCAATTCAATCCATAaWTTTGAAAAAAAAAATAT 978 



1166 GCCTGTGATGTTTGTTGCCTTGTCAGfiTfyy7rTTCTTCAG(y:AfiTTCT.Tn 1215 
III III IN I m I III N N IN III II IN I N I 

979 TCATGOCGTGTTTG TTGTTAGGTAGCATTCTTCAGGCAGTTCTTT 1023 

1216 A<y?Tf:AgT C ArnAAry;Try;ArTA(^GACCTTGAG(yrCftGGCTTCATC^ 1265 

III I ill III N N IN IN IN IN II N N I i IN N N N I 
1024 GGGTCCGTCACCAAGGTGGACTACCTGACCATGCGGCAAGGCTTCATCAA 1073 

• • • 

1266 CGTACGTGC. . ♦ .CTCCCCTTCTAGCTCCGCCATTGCTGCCGCGATGTAG 1311 

IN I I I IN 11 INN N II I 11 

1074 TGTATATACTAATCAAACCTGACCAATTCAACATTGATGATGC.AAACAG 1122 

• • • 

1312 CAGCAAAGCTTCT CAAGTTATCCTTCTGACGCTAAAGTTCCCA 1354 

II Nil! I INI I I I IN I IN I 
1123 AAGACCAGGTTTTTTTTTTCCGAGTTGTGCAT • TGAAGTTAATG 1165 



1355 TGTTTTTTCCTCAAATTATTCTGCGCAGGCa 
INN I II N I I IINNI 
1166 .GTTTTAGCTTC 



r ATTTfiTr^A AA ACAGC 1403 



N 1 N III N I N IN 
TTCTCTTTTGCAGGCGCCATTTGTCGCAGAATAGC 



1211 



1404 AAfn-rr!f^Af^-rr!nAr-A AiyrArAT(!AAfiAry?TCGATGGAGGACX;ACTTC 1453 

N IN N IN II N I 11 III IN N 1 1 N II I N 11 N II N N 111 
1212 AAGTTCX5ACTTCCACAAATACATCAAGAGGTCTTTGGAGGACGACTTCAA, 1261 

1454 ryyiu^Tn<yrrry;rAT(?AC^ 1503 
il '•••I ININIIN It iJULL^jiJ. 

12 62 V AGTTGTCGTTGGCATCAGGTCCG TCCTCGCTTT • . • 1294 



1504 CACCCCATGGATAG ATTTTAACAATTGCTGTCApGTTCCACATG AT AACA 

M INN I N 1 r IN NN I 
12 95 ATTAATTATAGGA CTCTTATATTCAACATTTTTTTT 

1554 ATATACTATGA.ACTTGGTCrrTTGCTCX:TTCTar^ CACGATCA 

INI I I I N INI I NN N UN 
1331 ATAAAGAAACATATTTAGTCT CCAGTTGTGTATGTGTATGTGGATCT 



1553 



1330 
1597 
1377 



1598 TGACACATTTGGCCTGTTTTCGCAG(XT(yCXKrr(yr^^ l€47 
1378 TGACAcJItTTGG CTGGTTTTGcJ^^ 1426 

1648 frrrAfyr^f^^ MpofrrTnArATTAATC^ 1697 

II I INI ill II III I Nil iill liit IN 
1427 CTTGTACTCTTCCTOGATATCCAOGGTA. •ATCCTTGTCCT ATTT 1469 

1698 CnXnTATTGCTTTGCAGCTAAATAAAACACTTGCAATTC^^ 1747 

I I II III N II I I INI nil INN 
1470 CATTCTTTTTTTTACTCTCAAAACXrrTGTTCTGAATTGGtCTTATAATC^ 1519 

1748 CCGCTCATTCTTCAACCATTTCTTTTTCTACTCATAGfiGCiTTGGCACGCT 1797 

II mm I I N IN I I NN imm N 

1520 CCATCGATTTTTTTTCAACTT ,TTTCOCCGCGTGTAGGTCTTGGCACACT 1568 
1798 r ATrTCy;ATTTrrTTTr JV'rrr'rTf rrCGTGGTAAGTGC . AGATTTCTCC . AT 1845 

II iiiu mil I mm i mil ii i mil ii i . 

1569 TATTTGGATCTCTTTTGTTCCTCTCATCGTAAGAGOGAAATTTCCCCTGT 1618 
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1846 06AAAGCAACAGCAAACCCAATT ^. TGATCGCAAT 1878 

I III! Illir III INI I II IN 

1619 OCAAAGAAACAGTTAACATAATTAATTATGCTTTAATTTATCATGAAAIIT 1668 

1879 GGAAACCCACACCTAATATTAACTCAAAATGTCAATTGTCGGTGCGTCTT 1928 

I 1 II III INI III III 111 I 

1669 TAATATCATCATATiUVCrrAATGIVACAAACATTCA. .TGTGAATGC^ 1716 

1929 ccytCAAr^c :ArnnT(yrrcrrGr^ 1978 

iiiiii nun Mill iiiiiiM Miin i iini 

1717 TTGTCTCAGATGGTCTTGTTAGTTGGGACCAAGCTAGAGATGGTGATCAT 1766 
1979 GGAGATaanrnrGGAGATaaAGGACaGC^CGA(XGTC^^ 2028 

I I I I I I I I I I I 11 I n MUM MM I M 111 Mil II I 
1767 GGAGATGGCXXrAAGAGATACAGGACAGGGCCACTGTGATCCAGGGAGCAC 1816 

2029 c^raGTnakr:nnr.AGnAAnAAGTTnTT(rr 2078 

I MM II M MIIMMIl IMMMIII MIMM IMIII 
1817 CTATGGTTGAACCAAGCAACAAGTACTTCTGGTTCAAOXSCCCTGACTGG 1866 

2079 lyi^CTnTTrrTTrATAriArCTfiArCTTCT ' I I 2107 

Ml I M I M M II II 1 M 1 11 I II 
1867 GTCTTGTTCTTCATACACCTGACACTCTTOOCATGTACATGTTTAAAACC 1916 



2108 nrAnAAcnr. CTTTCACATO^^riGnATTTTG 2136 

MIMMM MMMMIMMMII I 
2017 GACGGACGGATCGATCATCACCAGAAOGCATTTTCAGATGGOSCAT^^ 2066 

2137 TG'PGGAnAmG .GTACGCCAC I . . .CGATGAACTTGTCAGTT 2173 

I Mill 11 Ml M II 1 M M MINI 

2067 TATGGACTATGGTGTGTATGCTACTTGCTTAGTTGTTGCCATTAT 2116 

2174 AACATGGGTGTCA...AGGCACCGAGTGCCX;CTGATGA....! 2208 

II 1 MM I I MM II It MM 

2117 CTTAAGCAAATTAAGTGTGATGCATGCAGTGA CTAATCAGACAA 2160 

2209 . .ACTGCTCTGACGGAGATTTACTTGTGTTqr. I AQGQC 2243 

M Ml I Ml I MM MM MM 
21^1 AAAATGACACAGCTTGTTCATOSATCTGGTTGTTT^ 2210 

2244 Aaacyy^GGc rrrnAAGAAAnrtr^ArrjKnArG 2293 

M M M IIIIIMMMI Ml III MMIIIMII I 

2211 ACAOCTGGTCTGAAGAAATGCTTCCATGAAAATATTTGGCTGAGC^ 2260 

2294 CAAr;CTYyyivymwyyrrA 2343 

I I M I M II M I II II M I I M II 11 I I I I 1 1 1 1 M 
2261 GGAAGTCATTGTGGGGATCTCTCnCAGGTGCTATGCAG^ 2310 

2344 TC^nfrrnvArGnar^<^ n^^ 2389 

MM MMIMIMMMMMMMI I 11 11 I M M 
2311 TCCOGCTCTACGCGCTCX^TCACACAGGTGAACAAGCCATTCACAAA 2356 
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295 fiAnrTnATc;r:T(y;Tf;nr;r!TTCA TATrrnrTfifyrrr!Tr!ATCGTCACGCAGGA 344 
1 GAGCTCHTGCTG^ 50 

345 C<^rCATr:ATCGrrAAGA TATry!ATr:TCr!GAGGATGCrCGCCGACflTCATGT 394 
II I 11 Ml III linillll UN INI III: :MI 
51 TCC. • .CGTCTCCAGGATCTGCATCTCCAAGGAGGCCGGCGANAANATGC 97 

395 GGTicrTGCAAG. ^ r aronnArnGAGGGCCGCAAGCCCA 430 

II I I I I I I : M I I I I I II lilt II 

98 TCCCGTGCAAGCCTTACNACGGCGCCGGCGGTGGCAAAGGCAATGACAAT 147 

431 - GrAACTArf:TTfiArTArTGrrnc;GA 455 

111: II: I I I I I I I 
140 CACCGGAGGCTTCTCTGGCTCCAAGGCGAHAGCGANACCCACCGCCGGTT 197 

456 iaGTGAGCAGCAGAGCCCGG ACCAG ! 479 

HUM I I I I II : t 
198 CCTG . GCTGCCCCGGCCGGANTGGACGTCTGCGCC AAACAGGTGAGCACC 24 6 

480 CAGCTTCACGATGATGAAGAAA.TCAATACCX3AACTTTTTCTTGTTTTCT 528 

1:1 11:1 I I I Ml I : N Mil III II 
2 47 TANCGTCNCCACAAACCACAAACTANCTAATGAGCATGGACCTGAATTTC 296 

52 9 TCTGATTGTCGTCTTGGCTTGGCTTAATTGGTGTGTGTGTGTGTGTTTGC 578 

I II I I M II I II II M M II I II I I I I 

297 TTCTCTTCTTGGCTTGGCTTGACTAAATTGGT TGTGC 333 

579 A GGGrAAGGTGGrGrTrATGTrr Ar.GGGrAnrTTGrArrAGrTGrACGTC 628 

I M M M I M I M M M I M : : M M Ml I M I M I 11 M II 1 
334 ACGGCAAGGTGGCGCTGATGTCNNCGGGAANCATGCACCAACTGCACATA 383 

629 TTrATCTTCrGTGCTnGrGGTrTTr rATGTrACnTACAGrrGTCATCACCAT 678 

M M I 11 M I M M 1 I I I I II M I I III M M I M I I I M 1 II I 

384 TTCATCTTCGTGCTCGCCGTCTTCCACGTCTTGTAC AGCG TCGTC ACC AT 433 

67 9 AGrTnTAAnrrCTrTnAAA GTGAGCCTTTGCTTCTTCTTCTTCTTCTTTT 728 

I I M I II M I M t I 1 M I II 11 I II 
434 GACCCTAAGCCGTCTCAAAGTGAGCATCATACTC 467 

729 ACCGCACGTCTGTCTGTCAGGCGTACCTACCTGTTCATCAGGCTTGAGTA 778 

MM Mill III Ml I t I 

468 GAGCTGTTTGTCAATAATCCTT. . .GGTTTCCAATCCAATTCCA 508 

779 AAACTGTTCCATAATCTGCTCCGGCATAATCCTCTCCTCCTGCAGAiaSA^ 828 

M Ml I I IIIMMM I MMIIMMM 

509 AAGCTGGCACTGATCCTGCTCCGG CTTCCTGCAGATGAA 547 

829 AArATGGAAGAAATGGCSAaArrAGA rrArrArnTnCTTGGAATACCAGTTCG 878 

MMIMII lIMM I IMMI MM MM M IIIMM 
548 GCAATGGAAGAAGTGGGAGTCGGAGACCGCCTCGCTGGAGTATCAGTTCG 597 

879 rAAATG GTCAGGATCCCCCACTCTGCAATCTCCCCTTCTTCGAAACCAAA 928 

I MIIMIII Ml I III I Ml 

598 CGAATGGTCAG CTTCAACTTTTCTTACTGAAA 629 

929 CCTGATGATCCATTT . . . AAAGACGCAGGCACGATCA GAGTGAGT 970 

MUM Mill 1 I 11 11 I It II M I M t III 

630 CCGGATG. . .CATTTACAACAAACGCACGCACGATCAATCATCACAGTGT 67 6 

971 GAACTGAT . GTATGTTCATTTTTTGTGTCCT . TTCAGATCC , ■ TGCACGfi 1016 

M I M I II 1 M MM IIIMM I III 

677 GAGCCGATACGTTGAACCCGATTGAAATCCTCCGCAGATCCCATCGCCGG 72 6 
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1017 TTcrnrysTTr Af-nn Acr: AGACGTCGTT . CGmAAanannAnr rac^acaTCT 1065 
t I I M M ( I N 1 I I I H M 1 MM f I i I Ml M I I It 1 I M t M 
727 TGCCGGTTCACGCACCAGACGACGTTGGGTGAGGCGGCACCTGGGCCTCT 776 

1066 nr.Anr^r nnnTaanA 'rnAar^Taann^fz^'rnkcin^'rrrv^^ 1 1 IS 

I M 1 1 M 1 1 1 I It i I II If 1 1 ri 

777 CCAGCACCCCCGGCGTCAGATGGGT 801 



1166 GCCTGTGATGTTTGTTGCCTTGTCA Gf^TGGrrrrTCTTrAKGCAttTTrTTr 1215 

I I I M I! II I II N II II I I I M II 

602 GGTGGCCTTCTTCAGGCAGTTCTTC 826 

1216 AGGTCAGTCACCAAGGTGaArTAnr!TaA<^TTr;Afy;nrA(yy-T TCATnAA 1265 

t lit M I I I I I N I I I I I It II It I N II I II I I M M II U I 

B27 ACGTCGGTGACCAAGGTGGACTACCTGACCTTGCGGCAGGGCTTCATCAA 87 6 

12 66 CGTACGTGCCTCCCCTTCT AGCTCCGCCATTGCTGCCGCG ATGT AGC AGC 1315 
I 

877 C 877 



1366 CAAATTATTCTGCGCA GGgQgATTTC;Trf;r!AAAArAr;rAAC;TTrnArTTr 1415 

I I I t 11 I I I I I I II II M I I I t I M I 

87 8 GCGCATCTCTCGCAGGGCAACAGGTTCGACTTC 910 

1416 CACAAGTACATCAAGAGGTC G A TG GAGGACGACTTCAAGGTCfiTCGTCGG 1465 

I 1 I I N I I M M I I 1 M t I I I t I I I I I I I t N M M 1 I I M M I I I i 

911 CACAAGTACATCAAGAGGTCGTTGGAGGACGACTTCAAAGTCGTCGTCCG 960 

14 66 CATCAGGTACGTTCCATTCCTTCCTCTGCAC icACACCACAC 1506 

M I f II I M I 1 I I I I I I t N M I I I I 1 I II I 111 

961 CATCAGGTACGCGCCATTCCTTTCTCTGCACAAATTAATACATCCACCAC 1010 

1507 CCCATGGATAGATTTTAACAATTGCTGTCAGGTTCCACATGATAACAATA 1556 

1111:11111 II II: I : I M 

1011 CACATANGTAGATAGATAGA.-. TCGATANATANATTA 1045 

1557 TACTATGAACTTGGTCriTTGCTan'TGTCCTTGCACGATCATGAC^ 1606 

Mill I I I i I I I li I I MM Mi Mill 
104 6 TAC.AAGTGCCGGTACGTAOGTACGTCTCAT.. .ATGATCTTGACACATC 1091 

1607 TGGCCTGTTTTCGCAGCCT<^r:cy^TnTnc;nQTGTCQrr;AT 1656 

II Ml M MM Ml Ml I 11 II M M II M M MM 
1092 TGTCCTCTTGCCGCAATCTCAAGCTCTGGTTCGTGGCGGTCCTCATCCTC 1141 

1657 TTnrrTTG ACATr AATn CTATGGAgCTTCTCC - TCTCCGGTTTCTCTATTG 1705 

M M 11 I I I I i M M II M I II I I I I I I t 
1142 TTCCTTGATTTCGACGGTAGCCGCCTTGTCCATGCCCTGCTCGCCCTCTC 1191 

1706 CTTTGCAGCrAAATAAAACACTTGCAATTCGlXrrCGTGATCACCGC^^ 1755 

M M M I II II M i I M M I M 
1192 CTCCGCTTCTCTCCATAATTTGTG.AACTT6TCCCGT AT 1229 



TTGGCACGCTCATCTfiG 1804 



1756 TTTTC AACCATTTCTTTTTCT ACTC AT AGCififi- 

1 III II I M I M II M I I I M M M Mill 

1230 ATAACC AC ACC ACCGTCGTCTTCTCGC AGGGG ATCGGC ACTCTTCTCTGG 



1279 



1805 ATTTC TTTr ATr rrrr TrnTf^T A AGTGnAn ATTTriTr r ATCG A A Anr a a i854 

MM I 1 M I M It I I II M I t M I M t I I 

12 80 ATGTCCGTGGTTCCTCTCGTGGTAAGTCCA C AATTTGAAT AG A 1322 



1855 CAGCAAACCCAATTTGATCGCAATGGAAACCCACACCTAATATTAACTCA 1904 

III M M 1 I 11 II I M M I M It 
1323 CAACCTGTCCAATTGTGATGTACAGTACCTCCAAACTTAA TTA 1365 
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1949 
1413 



1463 



1905 AAATGTCAATTGTCGGTGCGTCTTCC TCAACAGj 

I I I I I I I I I J II MINI M I I I I 11 11 I f 1 I I 
1366 ACATGTCATTTGCTGAT . . GTCTTGCGTGTAACATTAGATCCTCTTGTGG 

1950 fiTTGGAACCA fifyrrCyTftfiATGfiTrftTrftTl^ftfiftTf^ 1M9 

iini MMMMMMIM I I I I I I M I I M I I i I 1 >> 1:111111 
1414 GTTGGGACCAAGCTGGAGATCGTGATCATGGAGATGGCCCAGGANATCCA 

2000 f^Anror;c;cn; Nfy-(?TCftTrAAGfTGG^ 2049 

tiillll MIIIM lllllll II Mil II II 1 1 I I M 1 i I I I I 
1464 TGACCGTCAGAGCGTC ^513 

205C ftGTTfrrTfrrfifiTTrrftq^fiCCcrfiftfrrfifTfi 2099 

III IIIIIMIli till II I Mill MINIM MM Mill 
1514 AGiACTTCTTCTTC^^ ^^63 

2100 Ar^^crrTC.rAnTKAr.aran^^'rrAaAT ^^r,^^'^'^^^ 2149 

II I MMIMMIIMIMIMMII Mill MUM (M Mill 
1564 ACACtlCTTCCAGAACTC^ 

2150 ACGCCACCGATGAACTTGTCAGTTAACATGGG 2181 

1614 A. . .CaiTACAAGTACTTGTC^ ^^^^ 



2182 



1711 
2224 



TGTCAAGGCACCGAGTGCCGCTG ATGAACTGCTCTGACGG AG 

I M I M M I I i I III Ml 

GACACAAAACTCAATCCAACGCGCGGTAGCAAACGAACGTTTTTCCGTAC 



ATTTAC . 
Ml 1 



.TTG 
I I i 



CCGCTTTCGCCCCATCCCAGCCCAAATTCGTTGACGTTGTTG 



2223 
1760 
2232 
1810 



17 61 GTTTTCGT* 

2233 r^^n^n^^-ri.r ^ ^ ^^r^rnnc^ 2282 
1861 ATGAGCATCGCCAAGGTCGTGCTGGGGGTAGCCTCCCAGATCT^ 



1910 



2333 f-TAT-AT^fiAn^Tfy^fyrrCTArfyfy-^ 2382 

:|| II I MM II I llJMIlM^mMIM 
1911 NTACATCACCTTOXXXrrilTAOGCGCTCGTCAC 



1943 



24 33 AATCATCTGTGTGTGCTGGCTTTGTATGCAG g ^ TrKyiATCAAACftT^ 24 82 

GCAG ATGGGCTC AC AC ATG AAG A 1966 



1944 

2483 r-^nr-ivT-r-TTrnArnAcytAaArGTrrftftGGC ■ ^^T^ft^/^/ft^^T^'^^^^^^ ^531 
I t : M II M M M II M M M I M M Ml M M M M M I M 

1967 



GAAGCaAcTTCGACGAGCAGACGGCCAAGGCGGCTC 2016 



2581 



2532 nj^nn^^rAAk a ^n.^f^n^An^AfKrv^^^ 

I I I I I I I i I M I I M I I M I MIIIM Ml M II II II II M I 
20X7 GATGGCcUgGAGAAGAAGAAGGCCCGAGACGCGG^^ 2066 



2582 AnATaATCc;rr^Arr;rAA rfir^^''^f^<^ft^^'rCGTrfif:CfiATfiCCGAGC 2631 

I t 1 M Mill Ml II M M M M M : M I II M 
2067 AGATGGGCGGCGGCGCGACGCCGAGC6TCGGCTNGTCGCCG 



2107 
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2632 CGGGGCTCATrArr;rr;T fy*^r-rTf;rTTrArAA<?aacATar;c?c;r:GGTCGGA 2681 

I M I M M I n I t I N I i I n 1 I I I I I M 
2108 GTGCACCTGCTCCACAAGGCCGGGGCGCGGTCCGA 2142 

2682 pGArcrccAr;AarGCGr :cr^<^<^'rrnr:r:AAffttArcr:AGCAGGAGGCTAGGG 2731 

llllllllllllli III I II II III M II Mil I ^ I I 
2143 CGACCCCCAGAGCGTGCCGGCGTCCCCGAGGGCCGAGAAGGAAGGCGGCG 2192 

2732 ArAT(;TArrrry:TTCT(y3T<^ry;r Arrrf^arACAGACTAAATCCTAAC 2781 

I III 1 1 1 1 1 1 1 1 1 1 1 II nil 

2193 GC GTGCAGCATCCGGCGCGCAAGGTACCTCCTTGT 2227 

2782 cAnAGGAGGAf^Trrfirrr c^Trr^TrfifirrrrTnGAAGCCGACATCCCCAG 2831 
111 II iiiiiii iMiiiiii INI iini II iiiiiiii I 

2228 GACGGGTGGAGGTCGGCCTCGTCGCCX^GCTCGACGCTCACATCCCCGG 2277 

• • • • 

2832 TGTAGATTTTTrrTTr Anr , - - rAGGGATGAGACAAGTTTCTG 2871 

lllinilll IIIIIII I 111 IIIMIMI II 

2278 TGCAGATTTTGGCTTCAGCACGCAACGTTGACCGATCAGACAAGTTCCTT 2327 

2872 TATT 2875 
IN 

2328 TTTT 2331 
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GCCTCCTCCCCCACCAAACCACACACACAGCACCCTACCTCCCT 

ACCTAGCCTCCGCTTTCTTTTTTTTCCTTTCCCCTCTCTTCCTTCCTCCCCCCCCCCACC 

TCCATACCCCCCCACC CCCAGGC ACCTCCCGGTTGCC TCCCCTGCATCTGCGTCTGCGTA 

CCTCGTAGACCCCCCCGTCTCCTTGCTCCCGC(»ACCAAGCAGCTTCCCGCGCTCGIiCCG 

MSDXXCVPARELPCTP5 twV^lftifcV,4 20 

helix I ^TCTCGCACAAAAAAGGGGTCCCGCCGOCCCACCTCCCCGAGACCCCCTCCTGGGCCaTC 60 

|A*».Y.A.V v ir.>,.A . A M V L V S Y . L Ml E K 6 L B K 40 

CCGCTGGTCTTCGCCCCC ATCG TGCTCGTGTCCGTCCTCATCC AACACGCCCTCCACAAC 1 20 

LGHWrQHARKKAtVEALEKK 60 

CTCCCCCATTGGTTCCACCACCCGCACAAG AAGCCCCTGTCGCAC6CGCTGCAGAACATG 1 80 

K A B Itr-.M:?. L ' V^G I S I. V. L> '.I ^yjUlTilLO^S^Msii BO 

helix 11 AAC6CGGAGCTCATCCTCCTCCCCTTCATATCCCTCCTCCTCATCGTCACCCACCACCCC 240 

1 1 Wir VA -■ KiMi: ' C I ' S\ COAAOVNW'PCXH 100 

ATCA7X:G CCAACATATGCATCTCCGAGGATGCCGCCGACGTCATCTCGCCCT6CAACCCC 300 

GTECHXPSKTVD TCPEGKVA 120 

CGCACC6AGCGCCGCAAGCCCACCAAGTACGTTGACTACTGCCOC6AGCGCAACGTGCC6 3 60 

LMSTGS LRQLn |V T*-. i:»-r ••V-rrt^-'A.^^V^'S^ 140 

hefiX III CTCATCTCCACGGCCACCTTCCACCACCTCCACCTCTTCATCTTCGTCCTCGCCGTCTTC 420 

iHVTTSVITIALl SRLXHRTVK 160 

CATCTCACCTACAGCGTCATCACCATAGCTCTAAGCCGTCTCAAAATGACAACATGGAAG 400 

KWETETTS LETQrANOPAIir 180 

AAATGG6AGACACAGACCACCTCCTTGCAATACCAGTTCCCAAATCATCCTCCACCCTTC 540 

XrTHQTSrvXRHLCLSSTPG 200 

CGGTTCACGCACCAGACCTCCTTCGTGAAGCCCCACCT66CCCTCTOCA6CACCCCTGGC 600 

XRWVVAFFROFFRSVTXVOT 220 

ATCA6ATCGGTCCTCGCCTTCTTCAGCCAGTTCTTCACGTCACTCACCAAGGT6GACTAC 660 

LTLRAGFINAH|.SQNSXrDr 240 

CTGACCTTGAGGGCAGGCTTCATCAACGCGCATTTCTCGCAAAACAGCAACTTCGACTTC 1 20 

HXYIKR5HEDDFK | V • V ' V G ' I..i:S^'n L| 200 

helix IV ^^**CTACATCAAGAGCTCGATGGACGACCACTTCAACGTCGTCGTCCGCATCACJCTC 780 

IPLVG VAI LTLFlI D I N G V G | TXiiL] 280 

helix V ccgctctggcctctcccgatcctcaccctcttccttgacatcaatgccgttgccaccctc 040 

II W I S F 1 P L V I L L -C V Cl T X L S M 300 

ATCTGGATTTCTTTCATCCCTCTCCTCATCCTCTTCTCTCTTGGAACCAAGCTGGAOATC 900 

I XMEHALCZOORASVIXCAP. 320 

ATCATCATGCACATGGCCCTGCACATCCAGGACCGGGCGACCCTCATCAACCCCCCCCCC 9 60 

V V E P S' N KrrwrHRPOVVLFF 340 

CTCCTCGAGCCCACCAACAACTTCTTCTGGTTCCACCCCCCCGACTCGGTCCTCTTCTTC 10 20 

I II LTLFQNAFQMAMFVWTVA 360 

ATACACCTCACGTTGTTCCAGAACGCGTTTCAGATGGCCCATTTTGTGTCGACAGTCCCC 1 080 

TPC LKKCYHTQICISIMX lv--^v| 380 

ACCCCCGGCTTGAAX5AAATCCTACCACACGCAGATCCGGCTCAGCATCATCAAC6TCGTG 1140 

helix VI CTCCTCCT- CCtCTC^^ T T r P L -T . AvV L<^.vl 400 

SqMGSNHXRSXPDEQTSKAL 420 

ACACACAT66CATCAAACATGAAGACCTCCATCTTC6ACGACCA6ACCTCCAACGG6CTC 1260 

TNWRIITAKB |X K X V R| D T O M L M 440 

ACCAACTGGCGGAACACC6CCAACGACAAGAAGAAAGTCCGAGACACG6ACKTCCT6ATC 1320 

AOMIGOATPSRGSSPHPSRC 460 

CCTCACATCATOGCCGACGCAACACCGAGCCGAGGCTCGTCGCCCATGCCGAGCCGGCGC 1380 

SS PVQ LLKKGMG RSDDPQSA 480 

TCATCACCCCTGCACCTGCTTCACAAGGGCATGCGCCGGTCGGACCACCCCCAGAGCCCG 1440 

PTSPRTQQEARDMTPVVVAB SOO 

CCCACCTCGCCAAGGACCCAGCAGG AGCCTACGGACATCTACCCCGTTCTCCTGCCGCAC 1 500 

PVIIRLHPKDRRRSASSSALS 520 

CCGGTGCACAGACTAAATCCTAACGACACGACGACCTCCCCCTCGTCGTCCGCCCTCGAA 1560 

ADlPSADrSrCQC" 
CCCGACATCCCCACTCCACATTTTTCCTTCACCCAGCCATCACACAAGTTTCTCTATTCA 

TCTTAGTCCCAATCTATACCCAACATACUATfiTGATGATTCGTAC AATAACA AATACAAT 
I t 

TTTTTAC TC AG TC 
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1 GAATTCAATT AAGGACAACA ACGGATGXTA GGCTTAXGCT AGAGAGGATT 

51 CATATCGATT AATTAACTGT ACTTAAGTTG AGGTAAAACT CTATCGATTG 

101 CTTTGGACAC CGCCTCTCCC ATGATCTGCC AAGTTGAGCC GGCCTACCTA 

ISl ATTTTCTTCG AAAGCA.CA.CA ACAAACGAAG GTAACCACTA ATCTAGACAC 

201 CACGCCTAAG TTATCAATTA CTACTCTAGT CTCGCGTAGA A-ACTTCATTC 

251 TTTATGGAGA GTGCTAGTAC TAGACTACTT AATA.TAATAG TAAGCGACAA 

301 ACCCACGACG ATGAGAATGT ACCTCACTTA CGTAGTCAAT TAAGTCGAAA 

351 AGGAAATCTT GAACACTTAC TTTATTAAAG AAGTATTCCC CGAGGTACAG 

401 GAGAGGAGAG CACGCCAATA ACTCCAGCAC TCCTCCGAAA CCTTTCTCAC 

4S1 TCTCTACCCT TTTTCTCCAC ACAACTAAAA TGATGTCTAA TGTATGAAAG 

SOI TGAGTTGTAC TCTATTTTGT TGTGTGTTTG GAAGTGAAAT TAGCTCATCC 

551 ttttatagca acttaatggt cggttgtagg ttgctaatta agtccgtaaa 

601 cactcacaac caccatcgtc aaccaatagg acatcgccac atcatcgaaa 

651 gctgacagtt aggggtccca accctgtttt gtccgaacca agcaaacaac 

701 ctctatctag cacctctctt ctatctctga caagtcggcc catatggcgg 

751 tgcactatgg attaagtcaa tttcagtcgt tttggactgt catgtgggcc 

801 cttccaatcc ttgtgctccc atatgattgg tcgaaagtac atttaattcc 

851 tgggtgagtg ctagaactaa tatgatagat gtgctccggc tcctgggaaa 

901 gaggccactt gacatacttg gggtagtgcc ccaagggtat tccctatcgc 

951 tttttcataa ttttctctct ccaaaatcgg acggaaacaa taaaaaagag 

1001 aggcgatgtt catcggcaaa tatctatttt tttgatagtg tcttccctta 

1051 aaacttgatt tttgcgaaca cttccggcta aaaccatgaa atcagagttc 

llol cttgtaacaa atttaatttg ccta-^ataca aaaaagatcg aatgg.agata 

1151 gcattaaact tgctccatac gaatcatatt agttggaccg taactcatag 

1201 aaaaacttgc aagttggttg acctatcaac cctcttatgt tgaccgtaa^ 

1251 cctc;ttatgc attaagga.tt aagtaccggc agatcgtcac tactcacgaa 

1301 TGCACAAATT TCCGGTAACG TA-GGATGGGA TGAGTTGGTC ACAAACGGGT 

1351 CACCACGTCG CCCAACCTGC CCCGATCGAG CCATTGGCCG GCGATGCACG 

1401 CGCTTTGACA CAGCCGCCCG CCGCCCCCCG GCCCGCCCCC GTTTTTAATA 

1C51 AAAACCGGCC CCCCCCTGTC /nAAGGTGTCA AAGTGTCAAG TGCATCAGAG 

1501 CTAACCTAGC GGTCACCCAG TCAGCTCACC CCGA.GACGCA CCAGGGGATC 

1551 TATCGGATCA TGGCAGGTGG GAGATCGGGA TCGCGGGACT TGCCGGAGAC 

1601 GCCGACGTGG GCGGTGGCCG TCGTCTCCCC CGTCCTCGTG CTCCTCTCCG 

1651 CCGCCATGGA GCACGGCCTC CACAACCTCA GCCATGTACG CGCGCGCGCA 

17 01 CGCGGTGTGC TCATCTCTCG AGTTAATTTG GTTGTTGTTG TTGTTGTGTT 

17 51 CTTGTGACAT CTCAATTAAC ATCCCATCGT GGTCGATCGA TCGCCCTGTG 

IBOl GTGGCGATAC TGCTTGCATT GCAGTGCTTC CCTAGGCGGC AGAAGAAGGC 

16 51 CATCXSGCGAC GCCCTCGACA .AGATCAAJ>iGC AGGTCACCCT CAGCCTCAGC 
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1901 


TCACCCTCAG 


CCTCCATCTC 


TAAATATTTG 


ACGCCGTTGA 


CTTTTTTAAA 


1951 


TATGTTTGAC 


CATTCGTCTT 


ATTTAAAAAA 


TTTAAGTAAT 


TATTAATTCT 


2001 


TTTTCTACCA 


TTTGATTCAT 


TGCTAAATAT 


ACTATTATGT 


ATACATATAG 


2051 


TTTTACATAT 


TTCACTAAAG 


TTTTTAAATA 


AGACGAATGG 


TCAAACATGT 


2101 


TTAAAAAAGT 


CAACGGCX3TC 


AAACATTTAG 


GAAGAAGAGA 


ATATTATATT 


2151 


GCTGCTCCCC 


TCTAGCCACT 


TTGCTGCCTC 


CCTCGTCATT 


TTTTCAAGTA 


2201 


TTTTACGCAA 


GACTGGTCCT 


CCAAATCAAA 


CGTCACAAAT 


AAGCCATTTA 


2251 


TAGTTTCCTT 


TCGCTTTTTA 


AGGGGGACTA 


CTTGTATTTA 


ATCATGGAGG 


2301 


AAACTACCAG 


TCGGATGTCC 


GATTACTTAA 


AAAAAAATTC 


GGGGGACTAA 


2351 


TTTTTTTGGC 


TGATCATCGG 


TGAAATATTA 


GGTTATATAT 


GTTGAAAAAA 


2401 


AATCAGCCAC 


AAACAATGAA 


ATATTTTGTG 


AAACACATAT 


TAGACACGTT 


2451 


GAAACGTATC 


ATTGTTACGT 


ATAAAACATC 


GAATGTTAAC 


AGATTAAAAC 


2501 


ATATGTTTTT 


TTTTAATCAG 


AATATAATCA 


TGCGATATAT 


TATTGTAAAG 


2551 


ATATAATTAC 


AACGAATACA 


ACAGTGCGAT 


CGGATTATAT 


ATATATTAGT 


2601 


AGTTTAAGAG 


AAAAATCATT 


TTGAAGATTA 


CTAGATACAT 


ACACGTATAG 


2651 


ATGGATGAAG 


TGGAGAGAGA 


TTAGAGATAA 


GTAGTTATAT 


GAATTTTGTG 


2701 


AAACACACTT 


AAGACATATG 


TTCAAACATA 


CTGCTATTAT 


GTATGAAATA 


2751 


TTGAGTTTTA 


ACGGTTTAAA 


ACACATATTC 


TTTTAATTAG 


AATGTAATAA 


2801 


TGTGATATCT 


TGTTGTAAAA 


TTTAATTACA 


TCTAATATAA 


CGGTGTGATT 


2851 


AGATTGTATG 


TTGGATAACA 


TGCCCATCGG 


TTGGCTTATT 


TAGGGAATAA 


2901 


GCCAAATGGT 


ATATTTGCAA 


ACGAAAAATA 


ATTTGTAAAT 


AAAACTTTTA 


2951 


TGTATGTATT 


CTTAACGATC 


TAGCAGCAAA 


GGCTGAAAAA 


TAAACTTCGA 


3001 


TGAAAAATCT 


CAAAATCAAC 


TCTTAAAATT 


TAAATTTTGG 


CTTATAAGTA 


3051 


TAGTTCCTAA 


CTAGTTTAGA 


AGAAAAAATA 


TTTAAAGCGG 


GGAAGAGGAA 


3101 


AAGGAATAAA 


CTAATAGCTA 


AATTATTGCA 


TGCATGTAGC 


GATTTGAGGA 


3151 


CGACCGAGTT 


GTTTTGTCTG 


GATCAGCCGA 


CCGAGACAGA 


GCAATCTTCT 


3201 


TTAATCATAA 


ATAACCAGAA 


AAACCATACC 


AGTTCATCAC 


AATGGACCGA 


3251 


GTCAGAGTCA 


TTACATATTT 


TTCATTGTTG 


CGCACAGQAT 


TCACCATGTT 


3301 


CTTATGGGAA 


ATATTTTTAA 


CTCTCAAATG 


GTTATGATTT 


TGAACTCTCA 


3351 


TTTTTGAGAG 


AGAATTAACA 


AGCGAGC6AG 


CAATCAGGCC 


AAAAAGGGAG 


3401 


AAAGAAAATT 


A'rrrTTGTTA 


ATimnTTTT 


AAGGTAGGGT 


GGAGGAGTCA 


3451 


TTACATGATT 


1-rrTTTTATA 


TTCCCTCGTT 


GATTATATGC 


TGTTCAAATG 


3501 


GTTATGATTT 


TTTTAAAAGA 


TAACAACAAT 


ACAAATTAGT 


ATGTGATAGA 


3551 


TCATTTCACG 


AGCATATAGG 


ATTAAATTTA 


ACTTCTGTAA 


ATTACAAAAC 


3601 


AAACAAGTTT 


AACTGTTAAT 


ATACATTAAA 


TTTGTTTTTT 


TCAACTTAGG 


3651 


AATTGAATTT 


TATOTATATA 


TTTGTAAAAT 


GATATATTAA 


TTTAU-rrTTT 


3701 


TAAAAAAATA 


ATTATTTAGA 


TAACACGCAA 


ACTAGAAAAC 


CACCGCAGAA 


3751 


GTTCTCATAT 


TTCTTGTCCT 


ATCTGCACTT 


GCAGAGCTGA 


TGCTGCTGGG 


3801 


CTTCATATCC 


CTGCTTCTCA 


CCGTGGCACA 


GGCGCCCATC 


TCCAAGATCT 
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SBOX 


TCAGTTCTTA 


AGCAAATTAA 


38SX 


AAAATGACAC 


AGCTTGTTCA 


S90I 


ACACCTGGTC 


TGAAGAAATG 


S9S1 


GGAAGTCATT 


GTGGGGATCT 


6001 


TCCCGCTCTA 


CGCGCTCGTC 


6051 


ATTAGCCGTT 


TCTTAATTGA 


6101 


OACCATTTGT 


CTTATTAAAA 


61S1 


TATCACTAAA 


AGTACTTTTT 


6201 


CTTTTAATAA 


GATAATGGTC 


6251 


CTATTAAGAA 


AAGGAGGGGT 


6301 


TCAAAATCAG 


TCCAAAACCT 


6351 


CAGTCCCCAT 


AAAATGTCTT 


6401 


GATGCCCTTT 


GTGTTGGTAT 


64S1 . 


GAACATGAAG 


AAGACAATTT 


6501 


ACTGGAGGAA 


GA».GGCGATG 


6551 


TTCCTGGCGC 


AGATGAGCGT 


£ it n 1 




1 v»C AC C. U. 


6651 


CGAGCCCAAT 


CACGGTGGCC 


67 0.1 


CGGTGCCGGC 


GOCGGCTGCG 


6751 


AGGAGGTGGA 


TGGCATCCTC 


6801 


CTTCAGCGCA 


CAACCGTGAC 


6851 


ACCAAACATA 


GGAGTTTAAT 


6901 


TATTGTGCGC 


GCACTTATAT 


6951 


GACAAGGTGA 


TGC?.TGCTGT 


7001 


AAAACTTACT 


CCCTACTTAA 


7051 


CGTCTTATTT 


AAAAfiJiTTTA 


7101 


AAGTACTTTT 


AGTGATAAAA 


71.51 


TAATTTTTTT 


TAATAAATCG 



GTGTGATGCA TGCACTGACT AATGAGACAA 
TCGATCTGGT TGTTTTGTGT GTGACAGGCA 
CTTCCATGAA AATATTTGGC TGAGCATCGT 
CTCTTCAGGT GCTATGCAGC TAjCATCACCT 
ACACAGGTGA ACAAGCCATT CACAAATTCT 
TGACACTGTT AATTTTTAGA CACACGTTTT 
ATATTTATGT AATTATCATT TGAGTTGTTT 
AAATAATTTA TA.TTTTGCAT TTGTACAATT 
AAACATGTGT CCAAAAGTTA ACAGCATCAT 
TTTTTTTTTT TGGAATTTTG CAAAATTTGT 
TTTTTTTTTT CGAAATTTCA GTTTCACTAC 
TTCTTTATTT CCACAAGATT GAACCCATGA 
GTGTTTTGGC CATCACTTGC AGATGGGATC 
TCGAGGAGCA AACGATGAAG GCGCTGATGA 
GAGAAGAAGA AGGTCCGGGA CGCCGACGCG 
CGACTTCGCG ACGCCGGCGT CGAGCCGGTC 
TGCAGGTCAC AGGGCGGGTC GGACGCCCGC 
TCACCACCGG CACCGGAGGG GACATGTACC 
TCTCGCCAGC TGCTAGACGA CCCGCCGGAC 
GTCGGCCGAC ATCGCCGATT CTGATTTTTC 
GGGGGCGATC GGTTTCTGTA TTGATGCTGT 
ATATATATAA TTGTTACGGT AAAATCTAAT 
TAGTCTTATA GCGCGL^CTGG TTCGTGATTA 
TTAGTTATAA AGatlTATCAG CGCAGCTAAA 
TAGATGACCT CGTTGATTTT TAACATTATT 
TGCAAATGTT TAAAACATAA ATCATGCTTA 
CAACTl^ACAA CAAAATAAAT TATACTTACC 
AATGG 
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1 


TTATACCATG 


TGAGAAAGGC 


TGGAAGCATA 


TGCTCTTAGC 


agggacgcgt 


51 


GCATGTTTAT 


ATAGGAGGCA 


TAAGCCGAAG 


AGATATACAT 


GAGGAGAGGT 


101 


TTAAGATCAG 


TCTATCTTAT 


TTACAGTTTA 


AACACZAAGGA 


GATAGAAAGA 


151 


GATCCTAACC 


TACACATGTT 


ATACAAGTCA 


CGTATAATAC 


AAGAGTTATT 


201 


TCGTCTAACA 


CCCTCCCCTC 


TGATATGATA 


AGTCGCCGGG 


AGAGAGAGAG 


251 


AGTGTGTGGC 


TGCCCTCGCT 


GCACTGCACG 


CACATGTTTA 


CTTCTCCGAC 


301 


TGAAACCACG 


GTGAAACCGG 


CGGCGGTGTC 


GCACTCCCCT 


GACTTTCCTC 


351 


GCCGGGGTCC 


CGTCCGGACA 


ATTAAACCGT 


CTGTACCTGC 


CGGGCJGTCQA 


401 


CCCGATCGTG 


ATGTGGCGCC 


GCTTTGTCTG 


CAGCGAGCTG 


CGTGGCCGAT 


451 


GGCAACAAAA 


CTGCGGTCAC 


ATACATGCAT 


ACCCCGCATA 


CCCCGACGCT 


501 


CACCAGTAAG 


TAGGCTGTGG 


TGCGGCACCA 


CGGGCTCGCC 


GCCATTCATG 


551 


CCATGCATGG 


GCCACCCGCC 


GGCGAAACCG 


CGGCGCTGCT 


GCCTGCCACC 


601 


CCGCCGCCGT 


TGACGAAGAC 


TTCGCCCGGC 


CATCCATAAA 


AGCATGCATG 


651 


GCTTGCTCTC 


ACCGGTCCGG 


CCACACACAC 


CACACTTCAC 


TTCGCCATTC 


701 


GCACCACCGA 


GAGCGTAGCG 


TAACGTGTGT 


TTGAAGTCCT 


ACCATTAATT 


751 


TTGCTGGATC 


GATGGCTGGG 


CCGGCGGGAG 


GTCGGGA6CT 


GTCGGACACG 


801 


CCGACGTGGG 


CGGTGGCGGT 


AGTCTGCGCC 


GOXIATGATAC 


TCX3TCTCCGT 


851 


CGCCATGGAG 


CACGCGCTCC 


ACAAGCTCGG 


CCACGTACGT 


GCTCTCGGTT 


901 


CACTAGTGCT 


TAACTGTTTT 


TGATGTTTTC 


GGGCGTGTTT 


GGTAGCCT6C 


951 


AIGGAGAGTG 


TAT6AGCCCA 


aaagttccx:t 


CCCCGACCCA 


CTTTTCGCTG 


1001 


TTTGGTAGGG 


TGTATGGGCT 


GAGGAGAGCA 


TGCATCAACT 


GATGCAAAAA 


1051 


GGGCCTCAGC 


ATAGCTGAGC 


CCAGCACCCC 


CGCAGAGGCG 


AGCTGAGGCG 


1101 


AGTTATX3CTG 


AGCCCATGCA 


CCCTCGCCCC 


gtcgcccx:gt 


CGCCCCGTCG 


1151 


CTCCCCCCCT 


GCACCTCTTC 


CTCCTCCCTC 


TTCCTACCAA 


ACACAGTCTC 


1201 


ATCCAAACAT 


GTAACAACAC 


ATGCATGACC 


ACCAAACAAC 


TGAAGATGAA 


1251 


TGTATTCATC 


ATGTCTATAC 


TTACCATGCA 


tcaacaggga 


ACAACTATGC 


1301 


TAGGGTGAGA 


ACAGCT6CCA 


AACACACCCG 


TGCACCTACT 


CATGCTGTGC 


1351 


CX3GCGCTGGC 


GTACGTGTGC 


AGTGGTTCCA 


CAAGTGGCGC 


AAGAAGGCCC 


1401 

V W 




GCTGGAGAAG 


ATGAAGGCGG 


AGCTCATGCT 


GGTGGGCTTC 


1451 




TCCTCATCGT 


CACGCAGGAT 


CCCGTCTCCA 


GGATCTGCAT 










GTGCAAGCCT 


TACGACGGCG 


1551 


CCGGCX3GTGG 


CA2UIGGCAAG 


GACAATCACC 


GGAGGCTTCT 


CTGGCTCCAA 


1601 


GGCGAGAGCG 


AGACCXy^CCG 


ccx;gttcctg 


GCTGCCCCGG 


CCGGAGTGGA 


1651 


CGTCTGCGCC 


AAACAGGTGA 


GCACCTAGCG 


TCGCCACAAA 


CCACAAACTA 


1701 


GCTAATGAGC 


ATGGACCTGA 


ATTTCTTCTC 


TTCTTGGCTT 


GGCTTGACTA 


1751 


AATTGGTTGT 


GCAGGGCAAG 


GTGGCGCTGA 


TGTCAGCGGG 


AAGCATGCAC 


1801 


CAACTGCACA 


TATTCATCTT 


CGTGCTCGCC 


GTCTTCCACG 


TCTTGTACAG 


1851 


CGTCX3TCACC 


ATGACCCTAA 


GCCGTCTCAA 


AGTGAGCATC 


ATACTCGAGC 
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1901 TGTTTGTCAA TAATCCTTGG TTTCCAATCC AATTCCAAAG CTGGCACTGA 
1951 TCCTCCTCCG GCTTCCTGCA GATGAAGCAA TGGAAGAAGT GGGAGTCGGA 
2001 GACCGCCTCG CTGGAGTATC AGTTCGCGAA TGGTCAGCTT CAACTTTTCT 
2051 TACTGAAACC GGATGCATTT ACAACAAACG CACGCACGAT CAATCATCAC 
2101 AGTGTGAGCC GATACGTTGA ACCGATTGAA TCCTCGCAGA TCCATCGCGG 
2151 TGCCGGTTCA CGCACCAGAC GACGTTGGTG AGGCGGCACC TGGGCCTCTC 
2201 CAGCACCCCC GGCGTCAGAT GGGTGGTGGC CTTCTTCAGG CAGTTCTTCA 
2251 CGTCGGTGAC CAAGGTGGAC TACXTTGACCT TGCGGCAGGG CTTCATCAAC 
2301 GCX3CATCTCT CGCAGGGCAA CAGGTTCGAC TTCCACAAGT ACATCAAGAG 
2351 GTCGTTGGAG GACGACTTCA AAGTCGTCGT CCGCATCAGG TACGCGCCAT 
2401 TCCTTTCTCT GCACAAATTA ATACATCCAC CACCACATAG GTAGATAGAT 
2451 AGATCGATAG ATAGATTATA CAAGTGCCGG TACGTACGTA CGTCTCATAT 
2501 GATCTTGACA CATCTGTCCT CTTGCCGCAG TCTCAAGCTC TGGTTCGTGG 
2551 CGGTCCTCAT CCTCTTCCTT GATTTCGACG GTAGCCGCCT TGTCCATGCC 
2601 CTGCTCGCCC TCTCCTCCGC TTCTCTCCAT AATTTGTGAA CTT6TCCCGT 
2651 ATATAACCAC ACCACCGTCG TCTTCTCGCA GGGATCGGCA CTCTTCTCTG 
2701 GATGTCCGTG GTTCCTCTCG TGGTAAGTCC ACAATTTGAA TAGACAACCT 
2751 GTCCAATTGT GATGTACAGT ACCTCCAAAC TTAATTAACA TGTCATTTGC 
2801 TGATGTCTTG CGTGTAACAT TAGATCCTCT TGTGGGTTGG GACCAAGCTG 
2851 GAGATGGTGA TCATGGAGAT GGCCCAGGAG ATCCATGACC GGGAGAGCGT 
2901 CGTCAAGGGT GCTCCCGCCG TCGAGCCCAG CAACAAGTAC TTCTGGTTCA 
2951 ACCGGCCTGA CTGGGTCCTC TTCCTCATGC ACCTCACACT CTTCCAGAAC 
3001 GCGTTTCAGA TGGCTCATTT CGTGTG6ACA GTGGTACGTA CAAGTACTTG 
3051 TCACTTCACT TAGGCTAACT CCAACAAACG ACCCCAAATT AATGGTCCGT 
3101 CX3CGTCTGTT TGGGGTATGT TTGGGGTAAA CGGACACAAA ACTCAATCCA 
3151 ACGCX3CGGTA GCAAACGAAC 6TTTTTCCGT ACGTTTTCGT CCGCTTTCGC 
3201 CCCATCCXaiG CCCAAATTCG TTGACGTTGT TGCATCGCAG GCCACX3CCCG 
3251 GCTTGAAGAA ATCCTACCAC GAGAAAATGG CAATGAGCAT CGCCAAGGTC 
3301 GTGCTGGGGG TAGCCGCCCA GATCTTGTGC AGCTACATCA CCTTCCCGCT 
3351 CTACGCGCTC GTCACGCAGA TGGGCTCACA CATGAAGAGA AGCATCTTCG 
3401 ACGAGCAGAC GGCCAAGGCX3 CTGACCAACT GGCX3AAAGAT GGCCAAGGA6 
3451 AAGAAGAAGG CCCGAGACGC GGCCATGCTG ATGGCGCAGA TGGGCGGCGG 
3501 C6CGACGCCG AGCGTCGGCT CGTCGCCGGT GCACCTGCTC CACAAGGCCG 
3551 GGGCGCGGTC CGACGACCCC CAGAGCGTGC CGGCGTCCCC GAGGGCCGAG 
3601 AAGGAAGGCG GCGGCGTCCA GCATCCGGCG CGCAAGGTAC CTCCTTGTGA 
3651 CGGGTGGAGG TCGGCCTCGT CGCCGGCX3CT CGACGCTCAC ATCCCCGGTG 
3701 CAGATTTTCG CTTCAGCACG CAACGTTGAC CGATCAGACA AGTTCCTTTT 
3751 TTPrrCGGTG AATAGAAGCG TATCATTTCA TTGATAGACA GTAGAAATTA 
3801 CAGGAATGGC TGTCCTACTA CTATGTACAC AAGGGCACAG CAAAGGATCA 
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Figure 1 O 

1 ATG GCi^.GGTG GGAGATCG^f: ;^.TCGCGCGAG TTGCCGGAGA CGCCGACGTG 

SI GGCGG*1X3GCC GTCGTCTGCG CCGTCCTCGT GCTCGTCTCC GCCGCCATGG 

101 AGCACGGCCT CCACAACC-l"C AGCCATAAAA CCACCGCAGA AGTTCTCATA 

ISI TTTCTTGTCC TATCTCCACT TGCAGAGCTG ATGCTGCTGC GCTXCArATC 

201 CCTGCTTCTC ACCGTGGCAC ACGCGCCCAT CTCCAAGATC TGCATCCCCA 

2 51 AGI-CGGCTGC CAACATCTTG TTGCCGTGCA AGGCAGCCCA AGATGCCATC 

3 01 GAAGAAGAAG CAGCAAGTGG TCGCCGGTCC TTGGCCGGCG CCGGCGGCGG 
351 GGACTACTGC TCGAAATTCG ATGGCAAGGT GGCGCTGATG TCGGCAAAGA 
401 GCATGCACCA GCTGCACATT TTCATCTTCC TGCTCGCCGT GTTCCAT6TT 
451 ACCTACTGCA TCATCACCAT GGGTTTAGGG CGCCTCAAAA TGAAGAAATG 
501 GAAGAAGTGG GAGTCACAGX CCAACTCATT GGAGTATCAG TTCGCAATCG 
S5l ATCCTTCACG ATTCAGGTTC ACGCATCAGA CGTCGTTCGT GAAGCGGCAT 
6 01 CTGGGATCAT TCTCAAGCAC CCCTGGGCTC AGATCGATCG TAGCATTCTT 
651 CAGGCA.GTTC TTTGCGTCCG TCACCAAGGT GGACTACCTG ACCATGCGGC 
701 AAGGCTTCAT CAATGCGCAT TTGTCGCAGA ATAGCAAGTT CGACTTCCAC 
751 AAATACATCA AGAGGTCTTT GGAGGACGAC TTCAAAGTTG TCGTTGGCAT 
8 01 CAGCCTCCCT CTGTGGTTCG TCGGAATCCT ' TGTACTCTTC CTCGATATCC 
8 51 ACGGTCTTGG CACACTTATT TGGATCTCTT TTGTTCCTCT CATCATCGTC 
901 TTGTTAGTTC CGACCAAGCT ACAGATGGTG ATCATGGAGA TGCCCCAAGA 
951 CATACAGGAC AGCGCCACTG TGATCCAGGG AGCACCTATG GTTGAACCAA 

1001 GCAACAAGTA CTTCTGGTTC AACCGCCCTG ACTGGGTCTT GTTTTTCATA 

10 51 CACCTGACAC TCTTCCAT^^J*^ CGCATTTCAG ATGGCGCATT TCGTATGGAC 

1101 TATGGCAACA CCtXSGTCTGA AGAAATGCTT CCATGAAAAT ATTTGGCTGA 

1151 GCATCGTGGA AGTCATTGTC GGGATCTCTC TTCAGGTGCT ATGCAGCTAC 

1201 ATCACCTTCC CGCTCTACGC GCTCCTCACA CAGATGGGAT CGAACATGAA 

1251 GTlAGACAATT TTCGAGGACC AAACGATGAA GGCGCTGATG AACTGGAGGA 

13 03 AGAAGGCGAT GGAGAACAAG rJ\GGTCCOGG ACGCCGACC-C GTTCCTGCCG 

13 SI CAGATGAGCG TCGACTTCGC GACCCCGGCG TCGAGCCGGT CCGCCTCGCC 

14 01 GGTGCACCTG CTCCAGGTCA CAGGGCGGGT CGCACGCCCG CCGAGCCCAA 
14 51 TCACGGTGGC CTCACCACCG GCACCGGAGG AGGACATGTA CCCGGTGCCC 
1501 GCG<;CGGCTG CGTCTCGCCA GCTCCTAGAC GACCCGCCGG ACAGGAGGTG 
1551 GATGGCATCC TCGTCGGCCG ACATCGCCGA TTCTGATTTT TCCTTCAGCG 
1601 CACAACGGTGA 
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Figure 1 1 



1 


ATGGCTGGGC 


CGGCGGGAGG 


TCGGGAGCTG 


TCGGACACGC 


CGACCTGGGC 


SI 


CG*1X5GCGGTA 


CTCTOCGCCG 


TCATGATACT 


CGTCTCCCTC 


GCCATGGACC 


lOX 


ACGCGCTCCA 


CAACCTCGGC 


CACTGGTTCC 


ACAAGTGGCG 


CAAGAAGGCXr 


151 


CTGGGGGAGG 


CGCTGGAGAA GATGAAGGCG 


GAGCTCATGC 


TGCTGGGCTT 


201 


CATATCCCTG 


CTCCTCATCG 


TCACGCAGGA 


TCCCGTCTCC 


AGGATCTGCA 


251 


TCTCCAAGGA 


GGCCGGCGAG 


AAGATGCTCC 


CGTGCAAGCC 


TTACGACGGC 


301 


GCCGGCGGTG 


GCAAAGGCAA 


GGACAATCAC 


CGGAGGCTTC 


TCTGGCTCCA 


351 


AGGCGAGAGC 


GAGACCCACC 


GCCftGTTCCT 


GGCTGCCCCC 


GCCGGAGTGG 


401 


ACGTCTGCGC 


CAAACAGGGC 


AAGGTGGCGC 


TGATCTCAGC 


GGGAAGCATG 


451 


CACCAACTGC 


ACATATTCAT CTTCGTGCTC GCCGTCTTCC 


ACGTCTTGTA 


501 


CAGCGTCGTC 


ACCATGACCC 


TAAGCCCTCT 


CAAAATGAAG 


CAATGGAAGA 


551 


AGTGGGAGTC 


GGA.GACCGCC 


TCCCTGGAGT 


ATCAGTTCGC 


GAATGATCCA 


601 


TCGCGGTGCC 


CCTTCACGCA 


CCAGACGACG 


TTGGTGAGGC 


GGCACCTGGG 


651 


CCTCTCCAGC 


ACCCCCGGCG 


TCAGATGGGT 


GGTGGCCTTC 


TTCAGGCAGT 


701 


TCTTCACGTC 


GGTGACCi^'-AG 


GTCGACTACC 


TCACCTTGCG 


GCAGGGCTTC 


751 


ATCA^.CGCGC 


ATCTCTCGCA 


GGGCAACAGG 


TTCGACTTCC 


ACAAGTACAT 


801 


CAAGAGGTCG 


TTGGAGGACG 


ACTTCAAAGT 


CGTCGTCCGC 


ATCAGTCTCA 


851 


AGCTCTGGTT 


CGTGGCGGTC 


CTCATCCTCT 


TCCTTGATTT 


CGACGGGATC 


901 


GGCACTCTTC 


TCfTGGATGTC 


CGTGGTTCCT 


CTCGTGATCC 


TCTTGTGGGT 


951 


TGGGACCAAG 


CTGGAGATGG 


TGATCATGGA 


GATGGCCCAG 


GAGATCCATG 


1001 


ACCGGCAGAG 


CGTCGTCAAG 


GGTGCTCCCG 


CCGTCGAGCC 


CACCAACA.»G 


1051 


TACTTCTGGT 


TCAACCGGCC 


TGACTGGGTC 


CTCTTCCTCA 


TGCACCTCAC 


1101 


ACTCTTCCAG 


;i-?vCGCGTTTC 


AGATGGCTCA 


TTTCGTGTGG 


ACAGTGGCCA 


1151 


CCCCCGGCTT 


G;^J*.G AAii.TGC 


TACCACGACA 


AAATGGCAAT 


GAGCATCCCC 


1201 


AAGGTCGTGC 


TGGGGCTAGC 


CGCCCAGATC 


TTGTGCAGCT 


ACATCACCTT 


1251 


CCCGCTCTAC 


CCGCTCGTCA 


CGCAGATGGC 


CTCACACATC 


AAGAGA\GCA 


1301 


TCTTCG/.CGA 


GCAGACGGCC 


AAGGCGCTGA 


CCAKCTGGCG 


AAAGATGGCC 


1351 


AAGG/vGAAGA 


AGAAGGCCCG AGACGCGGCC 


ATGCTGATGG 


CGCAGATGGG 


1401 


CGGCCGCGCG 


ACGCCGAGCG 


TCGCCTCGTC 


GCCGGTGCAC 


CTGCTCCACA 


1451 


ACGCrCGGGGC 


GCGCTCCGAC 


GACCCCCAGA 


GCGTGCCGGC 


GTCCCCGAGG 


ISOI 


CCCGAGAACG 


AA.GGCGGCGG 


CGTGCAGCAT 


CCGGCGCGCA 


AGGTACCTCC 


1551 


TTGTGACGGC 


TGGAGGTCGC 


CCTCGTCGCC 


GGCGCTCGAC 


GCTCACATCC 


1601 


CCGCTGCAGA 


TTTTCCCTTC 


AGCACGCAAC 


GTTCA 
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1 


GTTGGTACAT 


AAAAGACTCT 


TCCTTTGTCT 


GTTTTTTGTT 


CCCAGATTCA 


51 


TCTTTACTTA 


TTGACTAAAT 


TCTCTCTGGT 


GTGAGAAGTA 


AAATGGGTCA 


101 


CGGAGGAGAA 


GGGATGTCGC 


TTGAATTCAC 


TCCGACGTGG 


GTCGTCGCCG 


151 


GAGTTTGTAC 


GGTCATCGTC 


GCGATTTCAC 


TGGCGGTGGA 


GCGTTTGCTT 


201 


CACTATTTCG 


GTACTGTTCT 


TAAGAAGAAG 


AAGCAAAAAC 


CCCTTTACGA 


251 


AGCCCTTCAA 


AAGGTTAAAG 


AAGAGCTGAT 


GTTGTTAGGG 


TTTATATCGC 


301 


TGTTACKSAC 


GGTATTCCAA 

V7nJXX*X X \^Xi«>^^^^ 


C3GGCTCATTT 


CCAAATTCTG 


TGT6AAAGAA 


351 


AATGTGCTTA 

X v7 X X XX* 


TGCATATGJCT 


TCCATGTTCT 


CTCGATTCAA 


GACGAGAAGC 


401 


TGGGGCAA6T 


GAAC2ATAAAA 


ACGTTACAGC 


AAAAGAACAT 


TTTCAGACTT 


451 


TTTTACCTAT 


TGTTGQAACC 


ACTAGGCGTC 


TACTTGCTGA 


ACATGCTGCT 


501 


GTGCAAGTTG 


GTTACTCTAG 


CGAAAAGGGT 


AAAGTACCAT 


TGCTTTCGCT 


551 


TGAGGCATTX3 


CACCATCTAC 


ATATTTTCAT 


CTTCGTCCTC 


GCCATATCCC 


601 


<n X w X v7X^Vi»x^ X X 


V« X V3 X VJ7 X X X 


ACCGTGATTT 


TTGGAAGCAC 


AAGGATTCAC 


651 
\j -J- 




A AlVirjrS A f^TJA 

x\xV X vTvTvaiAwVa:'^ 


TTCGATCGCA 


GATGAGAAGT 


TTGACCCCGA 


701 


AAPAnr*TV"*TV^ 


Ar30AAAA(«AA 


GGGTCACfTCA 


TGTACACAAC 


CATGCTTTTA 


751 


TTAAAGAGCA 


X X X X X X^7^7 X 


ATTGGCAAAG 


ATTCAGTCAT 


CCTCGGATGG 


aoi 

W W J> 


ACfST* A ATV^nT 


TTV^r* A Af3P A 
X X w> X \M>xu%v9\_Mn 


ATTCTATGAT 

X%X XV«XX&X^3s^X 


TCTGTGACGA 


AATCAGATTA 


o^x 


X vanV* X X X A 


X V« X X \90 X X 


TYTA'nMV'KSAC 
x^vXVX xx^xvsnw 


ACATTGl'AAG 


GGAAACCCCA 




/VkVV^l xxinx 1 X 


wL*AL>Ank9 X A 1 


A XoA X w\»>\:i 


CTCTAGAGGA 


TGATTTCAAA 






GTAx xACa 1 1 


V7IAIV.I X lv:rtjr 


A X w X X X w X^w 


TVATYTPTTTT 

X N^XXX W X X X X X 






Ofwn Jk 7k l^/*!!/^ A 
O 1 lAiVV^VTis/ll 




X X X \.« X X XX 


GCATTTATTC 

^^^«>XX A ^* A * * 


X\JOX 


X 1 1 111 


1 1\» 1 l\i\^ 1 


r^iySTSn A AO A A 


AOTTGGAGCA 

XX V7 X X \ WXX 


nXSTGATTGCA 




IJ/l\jll 1 Av^v^ 1\«. 


/l. 1 1 X\3^ 


AflArZA A Ar^A'P 


GTAGCCATfKa 

V7 XXX.^7V<^WX^X X%7 


AAGGAGACTT 


1 1 PI 


Jva Lxaia l\xAAA 




A\9r\.*AX X XwX\7 


nTTPAGf!AAA 


CCTCAAATTG 




TTCTCTACTT 


GATCCATTTT 


A XV«V« X\« X X wl^ 


AGAATGCTTT 


TGAGATTGCG 




•1 1 X X X X X X 


I^ValAX X X\vV9\9X 


TAr'ATArYStST! 


TTCGACTCGT 


GCATTATGGG 


w X 




X A\JaX X\9 X xv> 


r»A A<^ ATTY^rST 


TATCGGGGTC 


TTCATTCAAG 


XO 3 X 


1 \9V» 1 1 1 V9\JaV7 


mrnTk r* Ik /^nn A ^ A 


V» X V7\.«V^ X v« XXX 


ACGCCATCGT 


CTCACAGATG 


X4 V/X 


V9V^AA,V3 ±JVid\^ 1 


iv^A Ar^A A ^nr* 


TATATTOfiAO 
XAX AX X w>varvw 


GAGAATGTGC 


AGGTTGGTCT 


1451 


TGTTNGTTGG 


GCACAGAAAG 


TGAAACAAAA 


GAGAGACCTA 


AAAGCTGCAG 


1501 


CTAGTAATGG 


AGACGAAGGA 


AGCTCTCAGG 


CTGGTCCTGG 


TCCTGATTCT 


1551 


GGTTCTGGTT 


CTGCTCCTGC 


TGCTCGTCCT 


GGTGCAGGTT 


TTGCAGGAAT 


1601 


TCAGCTCAGC 


AGAGTAACAA 


GAAACAACGC 


AGGGGACACA AACAATGAGA 


1651 


TTACACCTGA 


TCATAACAAC 


TGAGCAGAGA 


TATTATCTTT 


TCCATTTAGA 


1701 


GGATCATCAT 


CAGATTTTAG 


CTTCAAGGTC 


CGGTTTTGTG 


GTTTATACAT 


1751 


AAGTTATAGT 


GACTTGATTT 


TTTTGTTTTG 


TTACAAAGTT 


ACCATCTTTG 


1801 


GATTAGAATT 


GGGAAATTGA 


ATCTGTTTGT 


ATATTGTATT 


ATTTGGAACA 


1851 


TTGTGGATGC 


CCATGGATAT 


GTTTCTGTTC 







SUBSTITUTE SHEET (RULE 26) 



wo 98/04586 



PCT/GB97/02IM6 



20/28 ■ 



Figure 9 cont'd 



3851 TTGATCTTGT TACAAGAGCA GTAGAAAGGG ATTGCTCTCC ATTGATCTTG 

3901 TTAAGTTGTA TGTCACAAAT TGTTGCAGAA AAAAGTGTAT GTCATCCCAA 

3951 CCAAGAGCTG AGTTTGTGAT GXTTCGTGCA ATAAGAATTG CAAGTTTCAC 

4001 CGAJGTCAAAA A.TGAAGCTTC TAAGTACGCA CCAACCAACC GACrTCTTTCA 

4051 TCTCAACAAA AGAACTGTAA ATGGCAATAA TTCTGATAAC ATC6GAAGGG 

4101 AGCTC 
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3851 


GCATCCCCAA 


GTCGGCTGCC 


AACATCTTGT 


TGCCGTGCAA 


GGCAGGCCAA 


3901 


GATGCCATCG 


AAGAAAGAAG 


CAGCAAGTGG 


TCGCCGGTCC 


TTGGCCGGCG 


3951 


CCGGCGGCGG 


GGACTACTGC 


TCGAAATTCG 


ATGTGAGAAT 


AACACCAGCT 


4001 


6CCGGCAAGC 


ACAACCTCGA 


TGCAATAACT 


AATTTAACTA 


TAATTGATTT 


4051 


TTCTTGGGTT 


TTCTGCAGGG 


CAAGGTGGCG 


CTGATGTCGG 


CAAAGAGCAT 


4101 


GCACCAGCTG 


CACATTTTCA 


TCTTCGTGCT 


CGCCGTGTTC 


CATGTTACCT 


4151 


ACTGCATCAT 


CACCATGGGT 


TTAGGGCGCC 


TCAAAGTGAG 


TTTGTCGTTC 


4201 


TGTCCCTCAT 


GCACATGTTT 


TCTCTAGTTC 


TAGCAAGATT 


GTCAGTCCTT 


4251 


CAAATGGATT 


GTTTCGACAA 


GAAACCCAAT 


TTATTAATTT 


GCCAGTAAAT 


4301 


ATATAATAAT 


TGATCTTTCT 


TGGTTTTAGA 


TGAAGAAATG 


GAAGAAGTGG 


4351 


GAGTCACAGA 


CCAACTCATT 


GGAGTATCAG 


TTCGCAATCG 


GTAGTGAATT 


4401 


AAGAATCTCC 


CTAACTATTT 


CATTTCAGAA 


CCTTTATGAT 


AATGTCTTGA 


4451 


AAGAGGAGGA 


GCAAATCAGC 


TGAAAAATAT 


GATCGATCCA 


TGCAGATCCT 


4501 


TCACGATTCA 


GGTTCACGCA 


TCAGACGTCX3 


TTCGTGAAGC 


GGCATCTGGG 


4551 


ATCATTCTCA 


AGCACCCCTG 


GGCTCAGATG 


GATCGTGAGT 


TATCAATCTC 


4601 


CGAATACATG 


CTTGTTTTTT 


ATTCTTGCAA 


CTGGCCTAGC 


TGTTCCAATT 


4651 


CAATCCATAT 


TTTTTGAAAA 


AAAAAATATT 


CATGCCGTGT 


TTGTTGTTAG 


4701 


GTAGCATTCT 


TCAGGCAGTT 


CTTTGGGTCC 


GTCACCAAGG 


TGGACTACCT 


4751 


GACCATGCGG 


CAAGGCTTCA 


TCAATGTATA 


TACTAATCAA 


ACCTGACCAA 


4801 


TTCAACATTG 


ATGATGCAAA 


CAGAGACCAG 


GTTTTTTTTT 


TCGAGTGTGC 


4851 


ATTGAGTAAT 


GGTTTTAGCT 


•I'cri'CTcm:' 


TGCAGGCGCA 


TTTGTCGCAG 


4901 


AATAGCAAGT 


TCGACTTCCA 


CAAATACATC 


AAGAGGTCTT 


TGGAGGACGA 


4951 


CTTCAAAGTT 


GTCGTTGGCA 


TCAGGTCCGT 


CCTCGCTTTA 


TTAATTATAG 


5001 


GACTCTTATA 


TTCAACATTT 


TTTTTATAAA 


GAAACATATT 


TAGTCTCCAG 


5051 


TTGTGTATGT 


GTATGTGGAT 


CTTGACACAT 


TTGGCTGGTT 


TTGCAGCCTC 


5101 


CCTCTCTGGT 


TCGTCGGAAT 


CCTTGTACTC 


TTCCTCGATA 


TCCACGGTAA 


5151 


TCCTTGTCCT 


ATTTCATTCT 


TTTTTTTACT 


CTCAAAACCT 


TGTTCTGAAT 


5201 


TGGTCTTATA 


ATCACCATCG 


AWiriTTTTC 


AACTTTTTCC 


CCGCrGTGTAG 


5251 


GTCTTOGCAC 


ACTTATTTGG 


ATCTCTTTTG 


TTCCTCTCAT 


CGTAAGAGCG 


5301 


AAATTTCCCT 


GTCCAAAGAA ACAGTTAACA 


TAATTAATTA 


TGCTTTAATT 


5351 


TATCATGAAA 


ATTAATATGA 


TCATATAACT 


AATGAACAAA 


CATTCATGTG 


5401 


AATGCCACCG 


TTGTCTCAGA 


TCGTCTTGTT 


AGTTGGGACC 


AAGCTAGAGA 


5451 


TGGTGATCAT 


GGAGATGGCC 


CAAGAGATAC 


AGGACAGGGC 


CACTGTGATC 


5501 


CAGGGAGCAC 


CTATGGTTGA 


ACCAAGCAAC 


AAGTACTTCT 


GGTTCAACCG 


5551 


CCCTGACTGG 


GTCTTGTTCT 


TCATACACCT 


GACACTCTTC 


CATGTACATG 


5601 


TTTAAAACCT 


AAACCTTGCT 


GCTCAACTAC 


AAATAGTACT 


TTATCTTTCA 


5651 


CAATTAACAC 


CTAATTAACT 


AACATAGCAT 


CCATCCATTT 


GTGGCTACTG 


5701 


ATCGATGGGA 


CGACGGATCG 


ATCATCACCA 


GAACGCATTT 


CAGATGGCGC 


5751 


ATTTCGTATG 


GACTATGGTG 


TOTATGCTAC 


TTGCTTAGTT 


GTTGCCATTA 
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1 


MAGGRSGSRE 


IiPETPTWAVA 


WCAVIiVI.VS 


AAMEHGLHNIi 


SHiCTTAEVLI 


51 


FLVIiSAIAEL 


MLLGFISLIilf 


TVAQAPISKI 


C1PKSAAKII< 


IiPCKAGQDAI 


101 


EEEAAS6RRS 


LAGAGGGDYC 


SKFDGKVALM 


SAKSMHQIiHI 


FIFVIAVFHV 


151 


TyCIITMGLG 


RLKMKKWKKW 


ESQTNSIiEYQ 


FAIDPSRFRF 


THQTSFVKRH 


201 


LGSFSSTPGL 


RWIVAFFROF 


FGSVTKVDYIi 


TMRQGFIKAH 


liSONSKFDFH 


251 


KYIKRSIiEDD 


FKWVGISLP 


LWFVGIIiVLF 


LDIHGLGTLI 


WISFVPIjIIV 


301 


LLVGTKLEMV 


IMEMAQEXQD 


RATVIQGAPM 


VEPSNKYFWF 


NRPDWVLFFI 


351 


HIiTLFKNAFQ 


MAHFVWTMAT 


PGLKKCFHEN 


IWLSIVEVIV 


GISLQVLCSY 


401 


ITFPIiYAIiVT 


QMGSNMKXTI 


FEEQTMKAIiM 


NWRKKAMEKK 


KVRDADAFIiA 


451 


QMSVDFATPA 


SSRSASPVKL 


LQVTGRVGRP 


PSPITVASPP 


APEEDMYPVP 


501 


AAAASRQIiliD 


DPPORRWMPlS 


SSADIADSDF 


SFSAQR* 
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1 


MAGPAGGREIi 


SDTPTWAVAV 


VCAVMILVSV 


AMEHAIiHKLG 


HWFHKWRKKA 


51 


LGEAXiEKMKA 


EU^LVGFISI* 


LlilVTQDPVS 


RICISKEAGE 


KMLPCKPYDG 


101 


AGGGKGXDNH 


RRLIiWLQGES 


ETHRRFIAAP 


AGVDVCAKQG 


KVALMSAGSM 


151 


HOLHIFIFVL 


AVFHVLYSVy 


TMTIiSRbKMK QWKKWBSETA 


SLEYQFANDP 


201 


SRCRFTHQTT 


LVRRHIiGLSS 


TPGVRWWAF 


FRQFFTSVTK 


VDYIiTLRQGF 


251 


INAHLSQGNR 


FDFHKYIKRS 


liEDDFKVWR 


ISLKI.WFVAV 


LILFIiDFDGI 


301 


GTL1.WMSVVP 


LVILIiWVGTK 


LEMVIMEMAQ 


EIHDRESWK 


GAPAVEPSNK 


351 


YFWFNRPDWV 


LFLMHLTLFQ 


NAFOMAHFVW 


TVATPGLKKC 


YHEKMAMSIA 


401 


KWliGVAAQI 


LCSYITFPLY ALVTQMGSHM 


KRSIFDEQTA 


KALTNWRKMA 


451 


KEKKKARDAA 


MLMAQMGGGA 


TPSVGSSPVH 


LLHKAGARSD 


DPQSVPASPR 


501 


AEKEGGGVQK 


PARKVPPCDG 


WRSASSPAIiD 


AHJPGADFGF 


STQR* 
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Figure 1 5 



1 


MCHGGSGMSD 




vcTVivAisL AVERcriSnrrG 


TVI«KKKXQKP 


51 


DYEXXiQKVKE 


E1>£LLGFZSIj 


LLTVFQGUS KFCVKEInV1<M 


HKLPCSLDSR 


101 


RB^GASEHKM 




IjPIVCTTRRI* laeexavqvg 


YCSEKGKVPL 


151 




iFirvr-xxsH 


VTFCVIiTVTF GSTRXHQWKK 


WEDSIADEKF 


201 


DPETAtiRKRR 


VTIIVHNHAFI 


KESFtiGXGKD SVTI-GVmjSF 


LKQFYDSVTK 


251 


SDYVTEJEO/SF 


XMTMCKGIiZPK LtYFRKYMMRX ZJEDOFKQWG 


XSWYtWIPW 


301 




HTrFWIAJFIP 


FAXJjIjAVGTK x<euviaoi«ah 


EVAEICm/AXR 


351 


GDDWKPSDE 


HFWFSKPQIV 


LYLIHFXLFQ NAFEIAPFFW 


XIJVTYGFDSC 


401 


IMGOVRYTVP 


RLVICVFZQV 


LCSYSTl/PIiY AXVSQMGSSF 


KKAJCliEENVQ 


451 


VGI.VGWAQKV 


KQKRDtiKAAA 


SNGDEGSSQA GPGPDSGSGS 


APAAGPGACF 


501 


AGIQI*SRVTR 




TPDHNN* 
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